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PRELIMINARY AMENDMENT 



BOX PATENT APPLICATION 

Assistant Commissioner for Patents July 26, 2000 

Washington, DC 20231 

Sir: 

The following Preliminary Amendments and Remarks are 
respectfully submitted in connection with the above-identified 
application . 



IN THE SPECIFICATION: 

Please amend the specification as follows: 

Before line 1, insert — This application is the national 
phase under 35 U.S.C. § 371 of PCT International Application No. 
PCT/EP99/00478 which has an International filing date of January 
27, 1999, which designated the United States of America. — 

IN THE CLAIMS: 

Please amend the claims as follows: 



Claim 


2: 


Line 


1, 


change 


"claims 1 


-2" 


to 


— claim 


1 — 




Claim 


3: 


Line 


1, 


change 


"claims 1 


-2" 


to 


— claim 


1 or 


2 — 


Claim 


4: 


Line 


1, 


change 


"claims 1 


-2" 


to 


— claim 


1 or 


2 — 


Claim 


5: 


Line 


2, 


change 


"1-5" to 


— 1- 


-2 — 








Claim 


8: 


Line 


1, 


change 


"claims 1 


-4" 


to 


--claim 


1 — 





AMENDMENTS 



Docket No. 471-162P 



Claim 10: Line 1, change "claims 1-4" to — claim 1 — 
Claim 11: Line 1, change "claims 1-4" to — claim 1 — 
Claim 13: Line 2, change "claims 1-4" to — claim 1-- 

REMARKS 

The specification has been amended to provide a cross- 
reference to the previously filed International Application. The 
claims were also amended to delete improper multiple claims and 
to place the application into better form for examination. Entry 
of the present amendment and favorable action on the above- 
identified application are respectfully requested. 

If necessary, the Commissioner is hereby authorized in this, 
concurrent, and future replies, to charge payment or credit any 
overpayment to Deposit Account No. 02-2448 for any additional 
fees required under 37 C.F.R. § 1.16 or under 37 C.F.R. § 1.17; 
particularly, extension of time fees. 



Respectfully submitted, 
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471-162P 
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concern identified above with regard to the invention, entitled Recombinant proteins 

d erived from HGF and MSP ; b y Enzo MED ICO, 

Paolo MICHIELI , Chiara COLLESI , Gianfranco CASELLI , Paolo COMOGL IO 

inventor(s) 



described in 



C 1 
I I 
[ ] 



the specification filed herewith 

application Serial No. 

Patent No. 



filed 
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If the rights held by the above- identified small business concern are not exclusive, each individual, 
concern or organization having rights to the invention is listed below* and no rights to the invention are 
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R EC OMBIN AEHLJPRQTEIHS^^ 

Field of the invention 

The present invention relates to recombinant proteins obtained from 
the combination of structural domains derived from the a subunits of 
hepatocyte growth factor (HGF) and macrophage stimulating protein 
(MSP). 

In particular, the engineered factors of the invention are obtained by 
combination of the hairpin loop and kringle domains of the a chains of 
HGF and/or MSP, so as to obtain a structure having two superdomains 
with an intervening linker sequence. Moreover, the invention relates to 
DNA sequences encoding the above mentioned recombinant proteins, to 
the expression vectors comprising said DNA sequences and to host cells 
containing said expression vectors. The recombinant proteins of the 
present invention are biologically active, and their activity can be 
measured by determination of their ability to induce activation of the Met 
tyrosine kinase receptor, their "scattering" effect on epithelial cells, and 
their protective effect against cell death induced by chemotherapic drugs 
(vide infra). Therefore, these molecules can conveniently be used to 
prevent or treat the toxic side effects of the chemo therapeutical treatment 
of tumours, and to reduce iatrogenic cell damage induced by other types of 
drugs. 

Technological background 

Hepatocyte Growth Factor (HGF) and Macrophage Stimulating 
Protein (MSP) are highly related proteins both structurally and 
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functionally (Fig. 1 and 2). Both these factors are secreted as an inactive 
precursor, which is processed by specific proteases which recognise a 
cleavage site inside the molecule, dividing the protein in two subunits. 
These subunits, named a chain and p chain, are linked by a disulphide 
5 bond. Thus, the mature factor is an a- (3 dimeric protein. Only the mature 
(dimeric) form of the factor is able to activate its' receptor at the surface of 
the target cells (the Met tyrosine kinase in the case of HGF and the Ron 
tyrosine kinase in the case of MSP) and therefore to mediate biological 
responses (Naldini, L. et al., 1992, EMBO J. 11: 4825-4833; Wang, M. et 
10 al., 1994, J. Biol. Chem. 269; 3436-3440; Bottaro, D. et al., 1991, Science 

25: 802-804; Naldini, L. et al., 1991, EMBO J. 10: 2867-2878;- Wang, M. 

et al., 1994, Science 266: 117-119; Gaudino, G. et al., 1994, EMBO J. 13: 

3524-3532). 

The a chain of both factors contains a hairpin loop (HL) structure 
15 and four domains with a tangle-like structure named kringles (K1-K4; 
Nakamura T et al., 1989, Nature 342:440-443; Han, S. et al.., 1991, 
Biochemistry 30: 9768-9780). The precursor also contains a signal 
sequence (LS) of 31 amino acids (in the case of HGF) or of 18 amino 
acids (in the case of MSP), removed in rough endoplasmic 

20 reticulum, which directs the neoformed peptide to the secretive 
pathway. The P chain contains a box with a sequence homologous 
to that typical of serine proteases, but it has no catalytic activity 
(Nakamura T et al., 1989, Nature 342:440-443; Han, S. et al.., 1991, 
Biochemistry 30: 9768-9780). Both a and P chains contribute, to the 

25 binding of the growth factor to the respective receptor (Met for HGF and 
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Ron for MSP). 

HGF and MSP polypeptides are able to induce a variety of 
biological effects besides cell proliferation. The main biological activities 
of these molecules are: stimulation of cell division (mitogenesis); 
5 stimulation of motility (scattering); induction of polarisation and cell 
differentiation; induction of tubule formation (branched morphogenesis); 
increase of cell survival (protection from apoptosis). The tissues that 
respond to HGF and MSP stimulation are those where cells express the 
respective Met (HGF) and Ron (MSP) receptors. The most important 
10 target tissues of these factors are epithelial cells of different organs, such 
as liver, kidney, lung, breast, pancreas and stomach, and some cells 
of the hematopoietic and nervous systems. A detailed review of the 
biological effects of HGF and MSP in the various tissues can be 
found in Tamagnone, L. & Comoglio, P., 1997, Cytokine & Growth Factor 
15 Re-views, 8: 129-142, Elsevier Science Ltd.; Zarnegar, R. & 
Michalopoulos, G., 1995, J. Cell Biol. 129: 1177-1180; Medico, E. et 
al., 1996, Mol. Biol. Cell, 7: 495-504; Banu, N. et al., 1996, J. 
Immunol. 156: S2933-2940. 

In the case of HGF, the hairpin loop and the first two 
20 kringles are known to contain the sites of direct interaction with the 
Met receptor (Lokker NA et al., 1992, EMBO J., 11:2503-2510; 
Lokker, N. et al., 1994, Protein Engineering 7: 895-903). Two 
naturally- occurring truncated forms of HGF produced by some cells 
by alternative splicing have been described. The first one comprises 
25 the first kringle (NK1-HGF Cioce, V. et al., 1996, J. Biol. Chem. 
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271: 1311 Ot 1311 5) whereas the second one spans to the second 
kringle (NK2-HGF Miyazawa, K. et al., 1991, Eur. J. Biochem. 197: 
15-22). NK2-HGF induces cell scattering, but it is not mitogenic as 
the complete growth factor is (Hartmann, G. et al., 1992, Proc. 
5 Natl. Acad. Sci. USA 89: 11574-11578). However, NK2-HGF re- 
gains . mitogenic. activity.... in ... the .... presence, . of .. heparin, a 
glucosaminoglycan that binds HGF through a domain contained in 
the first kringle and which is likely to induce dimerization of NK2- 
HGF (Schwall, R. et al., 1996, J. Cell Biol. 133: 709-718). Moreover 
10 NK2-HGF, being a partial agonist of Met, behaves as a competitive 
inhibitor of HGF as far as the mitogenic activity is concerned (Chan, A. et 
al., 1991, Science 254: 1382-1385). NK1-HGF has also been described to 
exert partial stimulation of Met and competitive inhibition of HGF 
mitogenic activity (Cioce, V. et al., 1996, J. Biol. Chem. 271: 13110- 
15 13115). Anyway, a truncated factor is endowed with an activity markedly 
lower than the recombinant factors described in the invention, as shown in 
example 3. 

In the case of MSP, the interaction sites with the Ron receptor are 
less understood: some preliminary studies suggest a situation opposite of 
20 that of HGF, i.e. the p chain directly binds the receptor whereas the a 
chain would act stabilizing the complex (Wang MH et al., 1997, J. Biol. 
Chem. 272:16999-17004). 

The therapeutical use of molecules such as HGF and MSP is 
potentially valuable in a wide range of pathologies (Abdulla, S., 
25 1 99 7, Mol. Med. Today 3: 233). Nevertheless, a number of technical 
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as well as biological complications make the application of these 
molecules in clinics difficult. First of all, the pleiotropic character 
of these factors can causes poorly selective biological responses, 
which involve undesired side effects. For example, the use of HGF 
5 to prevent some side effects of the chemotherapeutic drug cisplatin 
has been proposed (Kawaida K et. al., 1994, Proc. Natl. Acad. Sci. 
91:4357-4361). Cancer patients treated with this drug can suffer 
kidney acute damage due to the cytotoxic action of cisplatin on 
proximal tubule epithelial cells. HGF is able to protect these cells 
10 against programmed death (apoptosis) induced by cisplatin, but at 
the same time it can induce an undesired proliferation of neoplastic 
cells. Other problems related to the pharmaceutical use of HGF and 
MSP are the necessity of their proteolytic activation and their 
stability, which causes technical problems. The NK1 and NK2 
15 truncated forms of HGF do not require proteolytic activation, but 
they have a reduced biological activity. 
Summary of the invention 

The present invention provides recombinant molecules 
composed of a combination of structural domains derived from the 
. 20 a chains of HGF and/or MSP, which overcome the problems of the 
prior art molecules described above. The molecules of this invention 
are composed of two superdomains connected by a linker. Each 
superdomain is composed of a combination of the HL and K1-K4 
domains of the a chain of HGF and/or MSP. These engineered 
25 factors induce selective biological responses, do not require 
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proteolytic activation, are stable and are more active than the truncated 
forms of HGF described previously. 
Detailed disclosure of the invention 

The present invention relates to recombinant proteins (which will be 
5 hereinafter referred to indifferently as proteins, molecules, engineered or 
recombinant factors) characterised by a structure that comprises two 
superdomains, each consisting of a combination of HL and K1-K4 
domains derived from the a chain of HGF and/or MSP, linked by a spacer 
sequence or a linker. In particular, the invention relates to proteins of 
10 general formula (I) 

[A] - B - [C] - (D) y (I) 

in which 

[A] corresponds to the sequence (LS) m -HL-Kl-(K2) n -(K3) 0 -(K4) p 
wherein (the numbering of the following amino acids refers to the HGF 
15 and MSP sequences as reported in Fig. 1 and 2, respectively): 

LS is an amino acid sequence corresponding to residues 1-31 of HGF or 1- 
18 of MSP; 

HL is an amino acid sequence derived from the a chain of HGF starting 
between residues 32-70 and ending between residues 96-127; or it is an 
. 20 amino acid sequence derived from the a chain of MSP starting between 
residues 19-56 and ending between residues 78-109; 

Kl is an amino acid sequence derived from the a chain of HGF starting 
between residues 97-128 and ending between residues 201-205; or it is an 
amino acid sequence derived from the a chain of MSP starting between 
25 residues 79-110 and ending between residues 186-190; 
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K2 is an amino acid sequence derived from the a chain of HGF starting 
between residues 202-206 and ending between residues 283-299; or it is 
an amino acid sequence derived from the a chain of MSP starting between 
residues 187-191 and ending between residues 268-282; 
5 K3 is an amino acid sequence derived from the a chain of HGF starting 
between residues 284-300 and ending between residues 378-385; or it is 

an amino acid sequence derived from the a chain of MSP starting between 

residues 269-283 and ending between residues 361-369; 

K4 is an amino acid sequence derived from the a chain of HGF starting 
10 between residues 379-386 and ending between residues 464-487; or it is 

an amino acid sequence derived from the a chain of MSP starting between 

residues 362-370 and ending between residues 448-481; 

m, n, o, p can be 0 or 1 ; 

the sum n + o + p is an integer from 1 to 3 or 0, with the proviso that 
15 n>o>p; 

B is the sequence [(X) q Y] r , wherein X = Gly and Y = Ser, or Cys, or Met, 
or Ala; 

q is an integer from 2 to 8; 
r is an integer from 1 to 9; 
20 [C] corresponds to the sequence HL-Kl-(K2) s -(K3) r (K4) u 
wherein HL, K1-K4 are as defined above, 

s, t, u are 0 or 1; the sum s + t + u is an integer from 1 to 3 or 0, with the 
proviso that s > t > u; 

D is the sequence W-Z, wherein W is a conventional proteolytic site, Z is 
25 any tag sequence useful for the purification and detection of the protein; y 
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is 0 or 1 . 

Non-limiting examples of W are consensus sequences for 
enterokinase protease, thrombin, factor Xa and IgA protease. 

Preferred proteins of general formula (I), are those in which: 
5 the HL domain is a sequence of HGF a chain ranging from amino 
acids 32 to 127,-or a sequence of MPS a chain ranging from amino acids 
19 to 98; the Kl domain is a sequence of HGF a chain ranging from 
amino acids 128 to 203, or a sequence of MPS a chain ranging from 
amino acids 99 to 188; the K2 v domain is a sequence of HGF a chain 
10 ranging from amino acids 204 to 294, or a sequence of MPS a chain 
ranging from amino acids 1 89 to 274; the K3 domain is a sequence of 
HGF a chain ranging from amino acids 286 to 383, or a sequence of MPS 
a chain ranging from amino acids 275 and 367; the K4 domain is a 
sequence of HGF a chain ranging from amino acids 384 to 487, or a 
15 sequence of MPS a chain ranging from amino acids 368 and 477. 

Among the possible combinations of the domains of general 
formula (I), the following (II) and (III) are preferred, concerning two 
recombinant factors named Metron Factor- 1 and Magic Factor- 1, 
respectively: 

. 20 LSmsp-HLmsp-K1msp-K2msp-L-HL H gf-K1hgf-K2hgf-D (Metron Factor- 1) 

(II) 
and 

LShgf-HLhgf-K1hgf-K2hgf-L-HLhgf-K1hgf-K2 HG f-D (Magic Factor- 1) 
(III) 

25 For both molecules, L is a linker sequence (Gly 4 Ser) 3 , D is a tag 
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sequence Asp 4 -Lys-His 6 . 

For Metron Factor- 1 , LSmsp is the sequence 1-18 of MSP, HL M sp is 
the sequence 19-56 of MSP, K1 M sp is the sequence 99-188 of MSP, K2 MSP 
is the sequence 189-274 of MSP, HL HGF is the sequence 32-127 of HGF, 
5 KIhgf is the sequence 128-203 of HGF, K2 HGF is the sequence 204-294 of 
HGF. 

For Magic Factor- 1, HL H gf> K1 H gf> K2 H gf are as defined above, 
LShgf is the sequence 1-31 of HGF. 

The hybrid molecules of the invention are prepared by genetic 
10 engineering techniques according to a strategy involving the following 
steps: 

a) construction of DNA encoding the desired protein; 

b) insertion of DNA in an expression vector; 

c) transformation of a host cell with recombinant DNA (rDNA); 

15 d) culture of the transformed host cell so as to express the recombinant 
protein; 

e) extraction and purification of the produced recombinant protein. 

The DNA sequences corresponding to HGF or MSP structural 
domains can be obtained by synthesis or starting from DNA encoding for 
. 20 the two natural factors. For example, screening of cDNA libraries can be 
carried out using suitable probes, so as to isolate HGF or MSP cDNA. 
Alternatively, HGF or MSP cDNA can be obtained by reverse 
transcription from purified mRNA from suitable cells. 

cDNAs coding for the fragments of HGF and MSP p chains can be 
25 amplificated by PCR (Mullis, K.B. and Faioona, F.A., 1987, Methods in 
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Enzymol. 155, 335-350), and the amplification products can be 
recombined making use of suitable restriction sites, naturally occurring in 
the factor sequences or artificially introduced in the oligonucleotide 
sequence used for the amplification. 
5 In greater detail, one of the above mentioned strategies can be the 

following: 

the portions of DNA encoding the LS, HL, Kl, K2, K3 and K4 domains 
are amplificated by PCR from HGF or MSP cDNA and then recombined to 
obtain the hybrid sequences corresponding to [A] and [C]. 
10 Oligonucleotides recognising sequences located at the two ends of the 
domains to be amplificated are used as primers. Primers are designed so as 
to contain a sequence allowing recombination between the DNA of a 
domain and the adjacent one. Said recombination can be carried out by 
endonuclease cleavage and subsequent ligase reaction, or making use of 
15 the recombinant PCR method (Innis, NA et al.., 1990, in PCR Protocols, 
Academic Press, 177-183). 

The sequence encoding the domain B (linker) can be obtained by 
synthesis of a double chain oligonucleotide, which can be inserted 
between [A] and [C] using suitable restriction sites. 
. 20 The resulting three fragments encoding for [A], [B] and [C] are then 

inserted in the correct sequence in a suitable vector. In this step it can be 
decided whether to add or not the domain D (tag), obtained by synthesis 
analogously to domain B, downstream fragment [C]. 

The recombinant expression vector can contain, in addition to the 
25 recombinant construct, a promoter, a ribosome binding site, an initiation 
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codon, a stop codon, optionally a consensus site for expression enhancers. 

The vector can also comprise a selection marker for isolating the 
host cells containing the DNA construct. Yeast or bacteria plasmids, such 
as plasmids suitable for Escherichia Coli, can be used as vectors, as well 
5 as bacteriophages, viruses, retroviruses, or DNA. 

The vectors are cloned preferably in bacterial cells, for example in 
Escherichia Coli, as described in Sambrook J., 1989, Molecular Cloning, 
Cold Spring Harbor Laboratory Press, New York, and the colonies can be 
selected, for example, by hybridisation with radiolabeled oligonucleotide 
10 probes; subsequently, the rDNA sequence extracted from the positive 
colonies is determined by known methods. 

The vector with the recombinant construct can be introduced in the 
host cell according to the competent cell method, the protoplast method, 
the calcium phosphate method, the DEAE-dextran method, the electric 
15 impulses method, the in vitro packaging method, the viral vector method, 
the micro-injection method, or other suitable techniques. 

Host cells can be prokaryotic or eukaryotic, such as bacteria, yeasts 
or mammal cells, and they will be such as to effectively produce the 
recombinant protein. 
20 After transformation, cells are grown in a suitable medium, which 

can be for example MEM, DMEM or RPMI 1640 in the case of mammal 
host cells. 
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The recombinant protein is secreted in the culture medium from 
which it can be recovered and purified with different methods, such as 
mass exclusion, absorption, affinity chromatography, salting-out, 
precipitation, dialysis, ultrafiltration. 
5 A simple, rapid system for the production of the molecules of the 

invention is, for example, transient expression in mammal cells. 

Accordingly, the plasmid containing the recombinant DNA 
fragment, for example PMT2 (Sambrook, J. et ai., 1989, Molecular 
Cloning, Cold Spring Harbor Laboratory Press), is transfected in suitable 
10 recipient cells, such as Cos7 (Sambrook, J. et ah, supra) by the calcium 
phosphate technique or other equivalent techniques. Some days after 
transfection, the conditioned medium of the transfected cells is collected, 
cleared by centrifugation and analysed for its content in factor. For this 
analysis, antibodies directed against HGF or MSP, or against any tag 
15 sequence, can be used: the supernatant is immunoprecipitated and then 
analysed by western blot with the same antibody. The supernatant 
containing the recombinant factor can also be used directly for 
biochemical and biological tests. The protein can be purified, for example, 
using a poly-histidine tag sequence, by absorption on a nickel resin 
20 column and subsequent elution with imidazole. 

The biochemical properties, of the recombinant factors of the 
invention were tested in connection with their ability to activate Met and 
Ron receptors. 

Sub-micromolar concentrations of the factors have proved to induce 
25 phosphorylation in Met tyrosine in human epithelial cells A549, whereas 
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they do not induce phosphorylation above basal values in cells expressing 
Ron. On the whole, the tests proved that the first two kringles of HGF 
maintain their ability to interact and to activate Met tyrosine kinase 
receptor, whereas the corresponding first two kringles of MSP are not 
5 sufficient for modulating the catalytic activity of the Ron receptor. 
However, the interaction with Ron, although at low affinity, can contribute 
to the recruitment of the factor at the cell surface, playing a similar role to 
low affinity receptors (of mature glycoprotein) which recruit the HGF 
intact molecule through the heparin-binding domain. 
10 The molecules of the invention have a marked biological activity, 

measured by the scattering tests, and a protecting activity against cell 
apoptosis induced by cisplatin or etoposide. 

In particular, the supernatant containing the recombinant factor has 
been found to promote scattering of epithelial cells of various nature even 
15 at nanomolar concentrations. In these tests, kidney epithelial cells 
(MDCK) or hepatocyte precursors (MLP29) were used. 

In an in vitro experimental system, in which DNA fragmentation 
typical of apoptotic cells is evaluated by the TUNEL method (Gavrieli, Y. 
et al., 1992, J. Cell. Biol. 117, 493-501), the recombinant factors protect 
20 against apoptosis induced by chemotherapeutic drugs at levels comparable 
with HGF and remarkably higher than MSP. The engineered molecules 
proved to be active on human primary epithelial cells from proximal 
tubule (PTECs), on an immortalised PTECs line (Loc) and on the already 
cited murine hepatocytes MLP29. 
25 Among the applications of the recombinant molecules of the 
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invention, the following can be cited: 

prevention of myelotoxicity; in particular they can be used for the 
expansion of marrow precursors, to increase proliferation of the 
hematopoietic precursors or to stimulate their entry in circle; 
5 - prevention of liver and kidney toxicity, and of mucositis following 

> 

antineoplastic treatments; in particular the' recombinant factors can 
be used to prevent toxicity (apoptosis) on differentiated cell 
elements of liver, kidney and mucosa of the gastroenteral tract, and 
to stimulate staminal elements of cutis and mucosas to allow the 
10 regeneration of germinative layers; 

prevention of chemotherapeutic neurotoxicity. 

In general, the proteins of the invention provide the following 
advantages, compared with the parent molecules HGF and MSP: 
they are smaller molecules with a more compact structure; 
15 - they are more stable and are produced in higher amounts; 

they require no endoproteolytic cleavage for activation, which 
transforms the HGF and MSP precursors into the respective active 
forms; 

they can be engineered in combinations of different functional 
. 20 domains, thereby modulating the biological effects, increasing the 

favourable ones and reducing those undesired (for example, 
protection from apoptosis versus cell proliferation). 
The invention has to be considered also directed at amino acid and 
nucleotide sequences referred to formula (I), having modifications which 
25 can, for example, derive from degeneration of genetic code, without 



WO 99/38967 PCT/EP99/00473 

15 

therefore modifying the amino acid sequence, or from the deletion, 
substitution, insertion, inversion or addition of nucleotides and/or bases 
according to all the possible methods known in the art. 

Furthermore, the invention relates to the expression vectors 
5 comprising a sequence encoding for a protein of general formula (I), 
which can be plasmids, bacteriophages, viruses, • retroviruses, or others, 
and to host cells containing said expression vectors. 

Finally, the invention relates to the use of the recombinant proteins 
as therapeutical agents, and to pharmaceutical compositions containing an 
10 effective amount of the recombinant proteins together with 
pharmacologically acceptable excipients. 

Description of the Figures 

(In the following legends, -His located after the name of the parent 
factors, truncated or recombinant, or of the plasmids, means that the 
15 respective sequences contain a poly-histidine tag). 
Figure 1: 

a) Nucleotide and amino acid sequence of human HGF (Gene 
Bank # M73239; Weidner, K.M., et al., 1991, Proc. Acad. Sci. USA, 
88:7001-7005). In contrast to the cited reference, in the numbering used 

. 20 herein, nucleotide No. 1 is the first base of the initiation codon (the A of 
the first ATG). The first amino acid is the corresponding methionine. The 
cDNA untranslated regions at 5' and 3' are neither represented nor 
considered in the numbering. 

b) Nucleotide and amino acid sequence of human MSP (Gene 
25 Bank # LI 1924; Yoshimura, T., et al., 1993, J. Biol. Chem., 268:15461- 
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15468). In contrast to the cited reference, in the numbering used herein 
nucleotide No. 1 is the first base of the initiation codon (the A of the first 
ATG). The first amino acid is the corresponding methionine. The cDNA 
untranslated regions at 5' and 3' are neither represented nor considered in 
5 the numbering. 
Figure 2: 

a) Molecular structure of Metron Factor- 1. The leader sequence 
is removed from the cells used for the production before secretion and is 
therefore absent in the mature molecule. The poly-histidine tag can be 

10 removed by digestion with the protease enterokinase. 

b) Nucleotide and amino acid sequence of Metron-F actor- 1 . The 
nucleotide sequence starts with the EcoRI site and terminates with the Sail 
site (first six bases and last six bases, respectively). The initiation codon 
(ATG) and the stop codon (TAG) are underlined. 

15 Figure 3: 

a) Molecular structure of Magic Factor- 1. The leader sequence 
is removed from the cells used for the production before secretion and is 
therefore absent in the mature molecule. Poly-histidine tag can be removed 
by digestion with the protease enterokinase. 
20 b) Nucleotide and amino acid sequence of Magic Factor- 1. The 

nucleotide sequence starts with the Sail site (first six bases and last six 
bases, respectively). The initiation codon (ATG) and the stop codon 
(TAG) are underlined. 
Figure 4: 

25 Production of Metron-F- 1 by transient transfection of mammal 
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cells. The conditioned supernatants from BOSC cells transfected with the 
control plasmid (CTRL) or with pRK7-Metron F-l-His were 
immunoprecipitated with an anti-MSP polyclonal antibody and detected 
by western blot with the same antibody. 
5 Figure 5: 

Quantitation of the recombinant proteins by western blot. (A) The 
proteins were absorbed on Sepharose-A-heparin beads and detected with 
an anti-poly-histidine monoclonal antibody. (B) The proteins were 
immunoprecipitated with an anti-MSP polyclonal antibody and detected 
10 with an anti-poly-histidine monoclonal antibody. 
Figure 6: 

Scattering test carried out on kidney epithelial cells (MDCK) using 
the recombinant proteins prepared by transient transfection. The protein 
content was quantified by western blot (see Fig. 5). (A) non-stimulated 
15 cells; (B) cells stimulated with control supernatant; (C) cells stimulated 
with HGF-His; (D) cells stimulated with NK2-HGF-His; (E) cells 
stimulated with Metron Factor- 1; (F) cells stimulated with Magic Factor- 1. 
Figure 7: 

Activation (phosphorylation) of Met receptor by the hybrid factor 
20 Metron Factor- 1. Human epithelial cells (A549) were stimulated with 
supernatants conditioned from BOSC cells transfected with the control 
plasmid (CTRL) or with pRK7-Metron-F-l-His (METRON F-i) at the 
indicated dilutions. Cell lysates from the stimulated cells were 
immunoprecipitated with an anti-Met monoclonal antibody and detected 
25 by western blot with an anti-phosphotyrosine monoclonal antibody. 
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Figure 8: 

Protective effect of Metron-F-1 against acute renal failure induced 
by HgCl 2 in vivo. Balb-c mice were injected i.v. with Metron-F-1 or 
vehicle at 0.5 h before and 6, 12, 24, 36 and 48 h after HgCl 2 i.v. 
5 administration. BUN and histological evaluation of renal necrosis were 
measured at 72 h. 

Data expressed as mean + e.s. of 7 animals/group (BUN) or 3 
animals/group (histology). 

The following examples illustrate in greater detail the invention. 
10 Example la: Preparation of the recombinant construct encoding 
Metron Factor-1 

HGF cDNA was obtained by the RT-PCR technique (Reverse 
Transcriptase PCR; in: Innis, M. A., et al.., 1990, PCR Protocols, 
Academic Press, 21-27) from a human iung fibroblast cell line (MRC5; 
15 Naldini, L. et al., 1991, EMBO J. 10: 2867-2878). MSP cDNA was 
obtained with the same technique from human liver (Gaudino, G., et al., 
1994, EMBO J. 13: 3524-3532). 

The fragment corresponding to MSP LS-HL-K1-K2 was amplified 
by PCR using MSP cDNA as template and the following oligonucleotides 
20 as primers: 
PI (sense) 

5' CGCGCG GAATTC CACCATGGGGTGGCTCCCACTCCT 3' 
P2 (antisense) 

5' CGCGCG CTCGAG GCGGGGCTGTGCCTCGGACCCGCA 3' 
25 in which the underlined palindromic sequences are the restriction sites for 
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the enzymes EcoRI (oligonucleotide PI) and Xhol (oligonucleotide P2). 
The PCR product was digested with the restriction enzymes EcoRI and 
Xhol and then purified by electrophoresis on agarose gel. 

The fragment corresponding to HL-K1-K2 of HGF was amplified 
5 by PCR using HGF cDNA as template and the following oligonucleotides 
as primers: 
P3 (sense) 

5* CGCGCG TCTAGA GGGACAAAGGAAAAGAAGAAATAC 3' 
P4 (antisense) 

io 5' CGCGCGAAGCTTTGTCAGCGCATGTTTTAATTGCAC 3' 

in which the underlined palindromic sequences are the restriction sites for 
the enzymes Xbal (oligonucleotide P3) and Hindlll (oligonucleotide P4). 
The PCR product was digested with the restriction enzymes Xbal and 
Hindlll and then purified by electrophoresis on agarose gel. 

15 For the linker sequence, the following partially complementary 

oligonucleotides were synthesised, and were subsequently annealed to 
obtain a double strand DNA fragment with sticky ends: 
P5 (sense) 

5 1 TCGAGGGCCCTGGCGGTTCTGGTGGCGGTGGCTCCGGC^rGCCGGTrCT 3' 

20 P6 (antisense) 

5 1 CTAG AGAACCGCCACCGCXGGAGCCACCGGCACGAGAACCGQ3ACCGCCC 3' 

in which the underlined bases are the sequences compatible with the 
restriction sites for the enzymes Xhol (oligonucleotide P5) and Xbal 
^ (oligonucleotide P6). 
25 The resulting three DNA fragments were subcloned in the EcoRI- 
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Hindlll sites of the expression vector pRK7 (Gaudino, G., et al., 1994, 
EMBO J. 13: 3524-3532), to obtain the recombinant plasmid pRK7- 
Metron-F-1, containing all the components of Metron Factor- 1 except the 
tag sequence. 

5 For the insertion of the tag sequence, the following partially 

complementary oligonucleotides were syrfthesised, and were subsequently 
annealed to obtain a double strand DNA fragment with sticky ends: 
P7 (sense) 

^AGCTGACGACGACGACAAACACCACCACCACCACCACXACTAGG GTCGAC S^ 

10 P8 (antisense) 

5' AGCTGTCGAC CCTAGTGGTGGTGGTGGTGfn^GGTGTTTGTrrTTCGTr.GTC 3" 
in which the underlined bases are compatible with the Hindlll restriction 
site and the boxed palindromic sequences are the consensus sequences for 
the enzyme Sail. The resulting double strand DNA fragment was inserted 

15 in the restriction site Hindlll of the recombinant plasmid obtained at the 
previous step (destroying the Hindlll site and creating the Sail site), to 
obtain the plasmid pRK7-Metron-F-l-His. 
Example lb: Production of Metron Factor-1 

The expression vector pRK7 contains a promoter of human 
. 20 cytomegalovirus immediate-early gene (CMV) and an episomal replication 
origin site of the DNA virus SV40. Therefore, this plasmid is particularly 
suitable for the expression of proteins in cells expressing the large T 
antigen of the virus SV40, such as kidney epithelial BOSC cells 
(Sambrook, J. et al.., 1989, Molecular Cloning, Cold Spring Harbor 

25 Laboratory Press). Metron Factor-1 can then be produced by transient 
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transfection of plasmid pRK7-Metron F-l-His in BOSC ceils. 

For transfection, 10 6 cells are seeded at day 0 in a 100 mm plate in 

90% Dulbecco's Modified Eagle Medium (DMEM)-10% bovine calf serum 

(10 ml/plate). At day 1, cells are transfected with 10 pg/plate of pRK7- 

5 Metron-F-l~His by lipofection, using the protocol provided by the 

> 

lipofectin producer (Gibco-BRL). At day 2, the DNA-containing medium 
is substituted by fresh medium with low content in serum (99.5% DMEM- 
0.5% bovine calf serum). At day 4 (48 hours after the end of the 
transfection), the medium is collected, cleared by centrifugation, and 
10 analysed for its content in Metron Factor- 1. 

This analysis can be carried out in different ways. For example, the 
recombinant protein present in the cleared supernatant can be 
immunoprecipitated with an anti-MSP antibody and then detected by 
western blot with the same antibody (Fig. 4). In the example shown in 
15 figure 4, 500 til of supernatant (cleared by centrifugation, buffered in 25 
mM HEPES and added with a protease inhibitors cocktail) were 
immunoprecipitated (2 hours at 4° C) with 20 p.1 of Sepharose-A beads 
(Pharmacia) covalently conjugated with 2 pi of anti-MSP polyclonal 
antibody. The beads pellet was washed 3 times with 500 pi of washing 
20 buffer (20 mM HEPES pH 7.4; 150 mM NaCl; 0.1% Triton X-100; 10% 
glycerol) and heated at 90° C for 2 minutes in 100 p.1 of Laemmli buffer. 
Eluted proteins were separated by SDS-PAGE on 8% BIS-acrylamide gel, 
transferred onto membrane (Hybond-C; Amersham) and analysed by 
western blot. For this analysis, the same rabbit serum used for 
25 immunoprecipitation was employed as primary antibody with a 1:1000 
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dilution and protein A conjugated with peroxidase (Amersham) was used 

as secondary antibody. Protein A was detected by ECL (Amersham) 

following the protocol provided by the producer. 

Alternatively, the recombinant protein can be partially purified by 

5 adsorption on Sepharose-A beads conjugated with heparin and subsequent 

> 

analysis by western blot using antibodies directed to poly-histidine tag 
(Fig- 5). 

In the example shown in figure 5, the Sepharose-A-heparin beads 
(20 u.1; Pierce) were incubated (4 hours at 4° C) with 500 u.1 of supernatant 

10 (cleared by centrifugation, buffered in 25 mM HEPES and added with a 
protease inhibitors cocktail) in the presence of 500 mM NaCl, washed 
with suitable buffer (500 mM NaCl; 20 mM HEPES pH 7.4; 0.1% Triton 
X-100; 10% glycerol) and heated at 90° C for 2 minutes in 100 fil 
Laemmli buffer. Eluted proteins were separated by SDS-PAGE on 8% bis- 

15 acrylamide gel, transferred onto membrane (Hybond-C; Amersham) and 
analysed by western blot. For this analysis, a mouse monoclonal antibody 
to poly-histidine (Invitrogen) diluted 1:5000 was used as primary antibody 
and an anti-mouse IgG ovine antibody conjugated with peroxidase 
(Amersham) was used as secondary antibody. The secondary antibody was 

20 detected by ECL (Amersham) following the protocol provided by the 
producer. 

The procedure of adsorption on heparin beads can also be used as 
protocol for the semi-purification of the recombinant protein. Furthermore, 
the molecule can additionally be purified making use of the poly-histidine 
25 affinity to heavy metals such as nickel. The protein containing poly- 
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histidine tag can be adsorbed on a nickel resin column (Invitrogen) and 
subsequently eluted with imidazole (the detailed protocol is provided by 
the manufacturer). 

Example lc: METRON-F-1 production in insect cells 

5 The cDNA encoding for Metron-Fl was subcloned in a suitable 

expression vector (p-FASTBAC) to generate a recombinant plasmid 
containing the Metron-Fl gene (p-FASTBAC-Metron). A competent E. 
Coli strain (DH10 Bac) was transformed with p-FASTBAC-Metron to 
generate BACMID DNA. The DNA of positive colonies was isolated and 
10 checked by PCR to show the correct integration of the expression vector. 
Subsequently, the DNA from three clones was transfected into Sf9 insect 
cells with CellFECTIN reagent to produce virus particles. Virus titer was 
tested by a plaque assay. Single plaques were isolated and used for further 
propagation of the baculovirus. Viral stock was subsequently expanded in 
15 insect cells to scale up METRON-F-1 production. To verify protein 
expression, insect cells were infected with a multiplicity of infection 
(MOI) of 1 in a small-scale reactor. Samples of supernatants were 
analysed by SDS-PAGE followed by western blotting. 

To produce amounts adequate for in vivo testing, insect cells were 
20 propagated in a 2.5-Liter stirred tank bioreactor. Cells were grown to a 
cell density of 1.106 ml" 1 before they were infected with a MOI of 1. Cell 
suspension was harvested 3 days post infection. The supernatant 
containing the recombinant protein was separated by centrifugation. The 
presence of Metron F-l in the supernatant was proved by SDS-PAGE 
25 followed by western blotting. Metron F-l was pre-purified by a dual step 
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affinity chromatography on heparin sepharose (heparin-Hi Trap, 
Pharmacia) at 6° C. For in vivo testing or for further purification steps, the 
eluted fractions containing Metron F-l were desalted by Sephadex G-25 
chromatography (PD-10 or HiPrep 26/10, Pharmacia). Metron F-l was 
5 further purified by chromatography on HisTrap columns (Pharmacia) and 
eluted by an imidazole gradient (0-0.5" M) using either a low-pressure 
system (Econo System, BIO-RAD) or an FPLC system (Pharmacia). 
Metron F-l was eluted at an imidazole concentration of about 0.15 M. For 
in vivo testing, the eluted fractions containing Metron F-l were freed of 
10 imidazole by Sephadex G-25 chromatography as already described, using 
the buffer to be used for animal treatment. 

Example 2a : Preparation of the recombinant construct encoding for 
Magic Factor-1 

HGF cDNA and the plasmid pRK7-Metron-F-l-His described above 
15 were used as starting DNA. The fragment corresponding to LS-HL-K1-K2 
of HGF was amplificated by PCR using HGF cDNA as template and the 
following oligonucleotides as primers: 
P9 (sense) 

5' CGCGCGGGATCCGCCAGCCCGTCCAGCAGCACCATG 3' 
20 P10 (antisense) 

5' CGCGCGAAGCTTTGTCAGCGCATGTTTTAATTGCAC 3' 
in which the underlined palindromic sequences are the restriction sites for 
the enzymes BamHI (oligonucleotide P9) and HindHI (oligonucleotide 
P10). The PCR product was digested with the restriction enzymes BamHI 
25 and HindHI and then purified by electrophoresis on agarose gel. 
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For the linker, the following partially complementary 
oligonucleotides were synthesized, and subsequently annealed to obtain a 
double strand DNA fragment with sticky ends: 
PI 1 (sense) 
5 5* AGCIT QXKKXjCro 
P12 (antisense) 
gCTAGAGAACXXjG^ 

in which the underlined bases are the sequences compatible with the 
restriction sites for the enzymes Hindlll (oligonucleotide PI 1) and Xbal 

10 (oligonucleotide PI 2). The fragment resulting by PCR and the double 
strand linker sequence were inserted in the plasmid pRK7-Metron-F-l-His 
in place of the fragment EcoRI-Xbal by means of an EcoRI-BamHI 
adapter, to obtain the plasmid pRK7-Magic-F-l-His. 
Example 2b: Production of Magic Factor-1 

15 Magic Factor-1 is produced on a small scale by transient 

transfection of BOSC cells analogously to what described for Metron 
Factor-1. Semi-purification is performed by adsorption on Sepharose-A 
beads conjugated with heparin followed by Western blot analysis using 
anti-poly-histidine antibodies (Fig. 5). 

20 Example 3 : Biological activity (scattering) on epithelial cells. 

The biological activity of recombinant HGF, NK2-HGF, Metron 
Factor-1 and Magic Factor-1 was tested by a "scatter" assay on MDCK 
epithelial cells. For this functional test, cells are plated at day 0 in 96-well 
plates (10 3 cells/well) in 90% DMEM - 10% bovine calf serum. At day 1 

25 the medium is substituted with fresh medium buffered with 50 mM 
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HEPES pH 7.4 and the supernatant containing the recombinant protein is 
added at different dilutions. At day 2 cells are washed with DPBS 
(Dulbecco's Phosphate Buffered Saline), fixed in 11% glutaraldehyde, 
stained with a Crystal-Violet solution and analysed by microscopy. The 
scattering activity is evaluated observing the morphology of the colonies, 
which are clustered in the negative control- (non-stimulated cells or 
stimulated with supernatant containing no factors) whereas they are 
dispersed in the positive control (HGF-His). The morphology of the cells 
themselves also varies upon stimulation: in fact, as it can be observed in 
Fig. 6, cells stimulated with HGF-His and Metron Factor- 1 have a more 
oblong, spindle-shaped form, characterised by protrusions of the cell 
membrane called pseudopodes. These morphological variations are the 
consequence of factor-induced activation of a genetic program involving 
the modification of a series of cellular parameters, such as digestion of 
cell matrix by specific proteases and increase in motility. 

The Table summarises the results of different tests, obtained with 
factors HGF, NK2-HGF, Metron Factor- 1 and Magic Factor- 1 on MDCK 
cells. The scattering units reported indicate the maximum dilution of the 
conditioned supernatant containing the factor, at which motogenic activity 
could be observed. Values are normalised for the protein content 
determined by western blotting as described above (see Fig. 5). These data 
indicate that the hybrid factors Metron Factor- 1 and Magic Factor- 1 have a 
scattering activity approximately three magnitudes higher than that of the 
NK2-HGF-His truncated form and one magnitude higher than that of 
HGF-His parental factor. 
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HGF-his 


NK2-his 


Metron F-l 


Magic Factor- 1 


Scatter units 


900 ± 29 


6±5 


5500± 1532 


7600 ± 150 



Table. Scattering activity of factors HGF-His, NK2-HGF-His 



Metron Factor-1 measured on kidney t epithelial cells (MDCK). The 
scattering units reported indicate the maximum dilution of the conditioned 
5 supernatant containing the factor, at which a motogenic activity can be 
observed. Values are normalised for the protein content determined by 
western blotting. 

Example 4a : Test for the evaluation of protection against programmed 
cell death (apoptosis). 

10 One of the most characterised side effect of the chemotherapeutic 

drug cisplatin is the induction of programmed cell death (apoptosis) of 
epithelial cells of the proximal tubule, which leads to acute renal failure 
(ARF). Thus, a factor that protects against cisplatin-induced cytotoxicity 
is highly desirable. An in vitro functional test has been used, which allows 
15 to evaluate the percentage of cisplatin- treated apoptotic cells in the 
presence or in the absence of a survival factor. This system utilises a cell 
line (LOC) derived from epithelial cells of human kidney proximal tubule, 
immortalised by ectopic expression of SV40 large T antigen. For the 
functional test, ceils are plated at day 0 in 96-well plates (10 J cells/well) in 
20 9 0% DMEM - 10% bovine calf serum. At day 1, the medium is substituted 
with medium containing 0.5% bovine calf serum buffered with 50 mM 
HEPES pH 7.4, which is added with different dilutions of the supernatant 
containing the recombinant factor. Cells are pre-incubated with these 
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factors for 6 hours, and then further incubated in the presence of 10 pg/ml 
cisplatin. At day 2, cells are washed with DPBS and the percentage of 
apoptotic cells is evaluated by the TUNEL technique (Boehringer 
Mannheim). The same kind of tests can be performed using primary 
5 cultures of human epithelial cells of kidney proximal tubule (PTEC). 
These tests proved that Metron Factor"- 1 and Magic Factor- 1 have 
protecting activity against cisplatin-induced programmed cell death. 
Example 4b: Protection against cisplatin-induced cytotoxicity by 
transient gene delivery of Metron Factor-1 and Magic Factor-1 
10 The protective effect of Metron F-l and Magic F-l against 

cisplatin-induced cytotoxicity was further demonstrated by a transient 
gene delivery approach. Simian kidney epithelial cells (COS) were 
transfected with a control empty vector, an expression vector for Metron 
F-l, or an expression vector for Magic F-l. Following transfection, cells 
15 were treated for 16 hours with cisplatin (20 jig/ml) and the percentage of 
surviving cells in each transfection was determined. Cisplatin treatment 
was calibrated to cause the death of approximately 20% of the cells in the 
negative control. Ectopic expression of Metron F-l or Magic F-l increased 
the survival rate to about 92.3% and 94.0%, respectively. 
20 Example 5 : Activation of the Met receptor by Metron Factor-1 and 
Magic Factor-1 

The ability of Metron Factor-1 and Magic Factor-1 to activate the 
Met receptor was tested by analysing the ability of the recombinant factors 
to induce tyrosine phosphorylation of Met in human epithelial cells 
25 (A549). For this analysis, A549 cells at 90% confluence in a 100 mm petri 
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dish were stimulated for 10 minutes with 1 ml of conditioned supernatant 
containing Metron Factor- 1, Magic Factor- 1 or no factor (as negative 
control) diluted 1:2.5 or 1:10 in DMEM. After stimulation, cells were 
washed in ice with PBS, lysated in 200 fj.1 of lysis solution (1% Triton X- 
5 100, 5 mM EDTA, 150 mM NaCl, 20 mM Tris-HCl pH 7.4), added with a 
cocktail of protease inhibitors, immunoprecipitated for 2 hours at 4° C 
with 10 ill of Sepharose-A beads covalently conjugated with an anti-Met 
monoclonal antibody (Naldini, L. et al., 1991, EMBO J. 10: 2867-2878), 
washed 3 times in the same lysis solution, and heated at 90°C for 2 
10 minutes to elute the absorbed proteins. These were separated by SDS- 
PAGE on a 8% BIS-acrylamide gel, transferred onto a membrane 
(Hybond-C; Amersham) and analysed by western blot. A mouse 
monoclonal antibody against phosphotyrosine (UBI) diluted 1:10000 was 
used as primary antibody and an anti-mouse IgG ovine antibody 
15 conjugated with peroxidase (Amersham) was used as secondary antibody. 
The secondary antibody was detected by ECL (Amersham) following the 
protocol provided by the manufacturer. This analysis revealed that Metron 
F-l and Magic F-l potently activate the Met receptor (Fig. 7). 
Example 6 : Protection against chemotherapy-induced renal failure by 
20 Metron Factor-1 in vivo 

Metron-F-1 was tested in a model of nephrotoxicity in Balb-c mice. 
The method used was substantially as described (Kawaida K et al., 1994, 
Hepatocyte growth factor prevents acute renal failure and accelerates renal 
regeneration in mice, Proc. Natl. Acad. Sci. 91:4357-4361). Briefly, renal 
25 failure was induced in male Balb-c mice weighing 20-25 g by an i.v. 
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injection of 7.5 mg/kg of HgCl 2 (7 animals/group). Renal damage was 
assessed by analysis of Blood Urea Nitrogen (BUN) and by histological 
evaluation, 72 h after HgCl 2 injection. Metron-F-1 was dissolved in 0.2 M 
NaCl, containing 0.01% Tween 80 and 0.25% human serum albumin and 
5 administered i.v. (100 u.g/kg in a posological volume of 6.6 ml/kg) 0.5 h 
before and 6, 12, 24, 36 and 48 h after Hg"Cl 2 injection. Controls animals 
were treated with the same amount of vehicle according to the same 
scheme. 

Metron-F-1 significantly prevented the onset of acute renal failure 
io induced by HgCl 2 , evaluated in terms of BUN (figure 8). BUN values 
were closely paralleled by the histological findings, scored by an 
independent investigator. 

In the following sequence listing: 
SEQ. ID. NO. 1: Magic F-l DNA coding sequence; 
15 SEQ. ID. NO. 2: Magic F-l amino acid sequence; 
SEQ. ID. NO. 3: Metron F-l DNA coding sequence; 
SEQ. ID. NO. 4: Metron F-l amino acid sequence. 
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CLAIMS 



1 . Recombinant proteins comprising two superdomains, separated by a 
spacer sequence (linker), obtained combining the HL and K1-K4 domains 
5 of HGF and/or MSP a chains, according to general formula (I): 

[A] - B - [C] - (D)y (I) 

in which 

[A] corresponds to the sequence (LS) TO -HL-Kl-(K2) Tl -(K3) 0 -(K4) p 
wherein (the numbering of the following amino acids refers to the HGF and 
10 MSP sequences as reported in Fig. 1 and 2, respectively): 

LS is an amino acid sequence corresponding to residues 1-31 of HGF or 1- 
18 of MSP; 

HL is an amino acid sequence starting between residues 32-70 of HGF a 
chain and ending between residues 96-127 of the identical chain; or it is an 
15 amino acid sequence starting between residues 19-56 of MSP a chain and 

ending between residues 78-109 of the identical chain; 
Kl is an amino acid sequence starting between residues 97-128 of HGF oc 
chain and ending between residues 201-205 of the identical chain; or it is 
an amino acid sequence starting between residues 79-1 10 of MSP a chain 
20 and ending between residues 186-190 of the identical chain; 

K2 is an amino acid sequence starting between residues 202-206 of HGF a 
chain and ending between residues 283-299 of the identical chain; or it is 
an amino acid sequence starting between residues 187-191 of MSP a chain 
and ending between residues 268-282 of the identical chain; 
25 K3 is an amino acid sequence starting between residues 284-300 of HGF a 
chain and ending between residues 378-385 of the identical chain; or it is 
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an amino acid sequence starting between residues 269-283 of MSP a chain 
and ending between residues 361-369 of the identical chain; 
K4 is an amino acid sequence starting between residues 379-386 of HGF a 
chain and ending between residues 464-487 of the identical chain; or it is 
5 an amino acid sequence starting between residues 362-370 of MSP a chain 
and ending between residues 448-481 of the identical chain; 
m, n, o, p are 0 or 1; 

the sum n + o + p is an integer from 1 to 3 6t 0, with the proviso that n 
> o>p; 

■10 B is the sequence [(X) q Y] T , wherein X = Gly and Y = Ser„ or Cys, or Met, 
or Ala; 

q is an integer from 2 to 8; 
r is an integer from 1 to 9; 

[C] corresponds to the sequence HL-Kl-(K2) s -(K3) t -(K4) n 
15 wherein HL, K1-K4 are as defined above, 

s, t, u are 0 or 1; the sum s + t + u is an integer from 1 to 3 or 0, with the 
proviso that s > t > u; 

D is the sequence W-Z, wherein Wis a conventional proteolytic site, Z is 
any tag sequence useful for the purifi cation and detection of the protein; y 
20 is 0 or 1. 

2. Recombinant proteins according to claims 1-2,— in which the HL 
domain is a sequence of HGF a chain ranging from amino acids 32 to 127, 
or a sequence of MPS a chain ranging from amino acids 19 to 98; the K.1 
domain is a sequence of HGF a chain ranging from amino acids 128 to 203, 
25 or a sequence of MPS a chain ranging from amino acids 99 to 188; the K2 
domain is a sequence of HGF a chain ranging from amino acids 204 to 294, 
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ox a sequence of MPS a chain ranging from amino acids 189 to 274; the K3 
domain is a sequence of HGF a chain ranging from amino acids 286 to 383, 
or a sequence of MPS a chain ranging from amino acids 275 to 367; the KL4 
domain is a sequence of HGF a chain ranging from amino acids 3 84 to 487, 
5 or a sequence of MPS a chain ranging from amino acids 368 to 477. 

3. Recombinant proteins according to claims 1-2 of formula (II): 
LSmsp-HLmsf-K-1 MS p-K2msp-L-HLhgf-K1hgf-K.2hgf-I5 (II) 

in which LSmsp is the sequence 1-18 of MSP, HLmsp is the sequence 19-56 
of MSP, KImsj, is the sequence 99-188 of MSP, K2 M sp is the sequence 189- 
10 274 of MSP 3 HLhgf is the sequence 32-127 of HGF, K1 H gf is the sequence 
128-203 of HGF. K2 H gf is the sequence 204-294 of HGF, L is the sequence 
(Gly 4 Ser) 3} D is the sequence Asp^-Lys-His^. 

4. Recombinant proteins according to claims 1-2 of formula (III): 
LSHGF-HL^Gp-Kl HO f-K2HGF-L-HLHOF-Kl H GF-K2HG?-D (HI) 

15 in which HL h <sp, KI^gf* K2 HGF:! L and D are as defined in claim 4, LSjtgf is 

: the sequence 1-31 of HGF. 

5. Nucleotide sequences encoding for the recombinant proteins of claims 
1-5. 

6. Expression vectors comprising the nucleotide sequences of claim 5. 
20 7. Prokaryotic or eukaryotic host cell transformed with the expression 

vector of claim 6. — 

8. Process for preparing the recombinant proteins of claims 1-4, which 

comprises the following steps: 

a) construction of DMA encoding the desired protein; 
25 b) insertion of DNA in an expression vector; 

c) transformation of a host cell with recombinant DNA (rDNA); 
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d) culture of the transformed host cell so as to express the recombinant 
protein; 

e) extraction and purification of the produced recombinant protein. 

9. Process according to claim 8, wherein the host cell is kidney epithelial 
5 30SC cell or SF9 insect cell, 

10. Recombinant proteins of claims 1-4 for use as therapeutic agents. 

11. Use of recombinant proteins of claims 1-4 in the manufacture of a 
medicament for the prevention or treatment of chemotherapeutic-induced 
toxicity. 

10 12. Use according to claim 9, wherein the chemotherapeutic-induced 
toxicity is myelotoxicity, kidney toxicity, neurotoxicity, mucotoxicity and 

'% hepatotoxicity. 

13. Pharmaceutical compositions containing an effective amount of the 
recombinant proteins of claims 1-4, in combination with pharmacologically 

15 acceptable excipients. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: DOMPE 1 S.p.A. 

(B) STREET: Via Canpo di Pile 

(C) CITY: L'AQUILA 

(E) COUNTRY: ITALY 

(F) POSTAL CODE (ZIP): 67100 

(ii) TITLE OF INVENTION: RECOMBINANT PROTEINS DERIVED FROM HGF AND MSP 
(iii) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : Patentln Release #1.0, Version #1.30 (EPO) 
(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATGTGGGTGA CCAAACTCCT GCCAGCCCTG CTGCTGCAGC ATGTCCTCCT GCATCTCCTC 60 

CTGCTCCCCA TCGCCATCCC CTATGCAGAG GGACAAAGGA AAAGAAGAAA TACAATTCAT 120 

GAATTCAAAA AATCAGCAAA GACTACCCTA ATCAAAATAG ATCCAGCACT GAAGATAAAA 180 

ACCAAAAAAG TGAATACTGC AGACCAATGT GCTAATAGAT GTACTAGGAA TAAAGGACTT 240 

CCATTCACTT GCAAGGCTTT TGTTTTTGAT AAAGCAAGAA AACAATGCCT CTGGTTCCCC 300 

TTCAATAGCA TGTCAAGTGG AGTGAAAAAA GAATTTGGCC ATGAATTTGA CCTCTATGAA 360 

AACAAAGACT ACATTAGAAA CTGCATCATT GGTAAAGGAC GCAGCTACAA GGGAACAGTA 420 

TCTATCACTA AGAGTGGCAT CAAATGTCAG CCCTGGAGTT CCATGATACC ACACGAACAC 480 

AGCTATCGGG GTAAAGACCT ACAGGAAAAC TACTGTCGAA ATCCTCGAGG GGAAGAAGGG 540 

GGACCCTGGT GTTTCACAAG CAATCCAGAG GTACGCTACG AAGTCTGTGA CATTCCTCAG 600 
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TGTTCAGAAG TTGAATGCAT GACCTGCAAT GGGGAGAGTT ATCGAGGTCT CATGGATCAT 660 
ACAGAATCAG GCAAGATTTG TCAGCGCTGG GATCATCAGA CACCACACCG GCACAAATTC 720 
TTGCCTGAAA GATATCCCGA CAAGGGCTTT GATGATAATT ATTGCCGCAA TCCCGATGGC 780 
CAGCCGAGGC CATGGTGCTA TACTCTTGAC CCTCACACCC GCTGGGAGTA CTGTGCAATT 840 
AAAACATGCG CTGACAAAGC TTCGGGCGGT GGCGGTTCTG GTGGCGGTGG CTCCGGCGGT 900 
GGCGGTTCTC TAGAGGGACA AAGGAAAAGA AGAAATACAA TTCATGAATT CAAAAAATCA 960 
GCAAAGACTA CCCTAATCAA AATAGATCCA GCACTGAAGA TAAAAACCAA AAAAGTGAAT 1020 
ACTGCAGACC AATGTGCTAA TAGATGTACT AGGAATAAAG GACTTCCATT CACTTGCAAG 1080 
GCTTTTGTTT TTGATAAAGC AAGAAAACAA TGCCTCTGGT TCCCCTTCAA TAGCATGTCA 1140 
AGTGGAGTGA AAAAAGAATT TGGCCATGAA TTTGACCTCT ATGAAAACAA AGACTACATT 1200 
AGAAACTGCA TCATTGGTAA AGGACGCAGC TACAAGGGAA CAGTATCTAT CACTAAGAGT 1260 
GGCATCAAAT GTCAGCCCTG GAGTTCCATG ATACCACACG AACACAGCTA TCGGGGTAAA 1320 
GACCTACAGG AAAACTACTG TCGAAATCCT CGAGGGGAAG AAGGGGGACC CTGGTGTTTC 1380 
ACAAGCAATC CAGAGGTACG CTACGAAGTC TGTGACATTC CTCAGTGTTC AGAAGTTGAA 1440 
TGCATGACCT GCAATGGGGA GAGTTATCGA GGTCTCATGG ATCATACAGA ATCAGGCAAG 1500 
ATTTGTCAGC GCTGGGATCA TCAGACACCA CACCGGCACA AATTCTTGCC TGAAAGATAT 1560 
CCCGACAAGG GCTTTGATGA TAATTATTGC CGCAATCCCG ATGGCCAGCC GAGGCCATGG 1620 
TGCTATACTC TTGACCCTCA CACCCGCTGG GAGTACTGTG GAATTAAAAC ATGCGCTGAC 1680 
AAAGCTGACG ACGACGACAA ACACCACCAC CACCACCACC ACTAG 1725 
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(2) INFORMATION FOR SEQ ID NO: "2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO > 

> - 

(iv) ANTI-SENSE: NO " ' " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Tip Val Thr Lys Leu Leu Pro Ala Leu Leu Leu Gin His Val Leu 
15 10 15 

Leu His Leu Leu Leu Leu Pro He Ala He Pro Tyr Ala Glu Gly Gin 
20 25 30 

Arg Lys Arg Arg Asn Thr He His Glu Phe Lys Lys Ser Ala Lys Thr 
35 40 45 

Thr Leu He Lys He Asp Pro Ala Leu Lys He Lys Thr Lys Lys Val 
50 55 60 

Asn Thr Ala Asp Gin Cys Ala Asn Arg Cys Thr Arg Asn Lys Gly Leu 
65 70 75 80 

Pro Phe Thr Cys Lys Ala Phe Val Phe Asp Lys Ala Arg Lys Gin Cys 
85 90 95 

Leu Trp Phe Pro Phe Asn Ser Met Ser Ser Gly Val Lys Lys Glu Phe 
100 105 110 

Gly His Glu Phe Asp Leu Tyr Glu Asn Lys Asp Tyr He Arg Asn Cys 
115 120 125 

He He Gly Lys Gly Arg Ser Tyr Lys Gly Thr Val Ser He Thr Lys 
130 135 140 

Ser Gly He Lys Cys Gin Pro Trp Ser Ser Met He Pro His Glu His 
145 150 155 160 

Ser Tyr Arg Gly Lys Asp Leu Gin Glu Asn Tyr Cys Arg Asn Pro Arg 
165 170 175 

Gly Glu Glu Gly Gly Pro Trr> Cys Phe Thr Ser Asn Pro Glu Val Arg 
180 185 190 

Tyr Glu Val Cys Asp He Pro Gin Cys Ser Glu Val Glu Cys Met Thr 
195 200 205 

Cys Asn Gly Glu Ser Tyr Arg Gly Leu Met Asp His Thr Glu Ser Gly 
210 215 220 
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Lys He Cys Gin Arg Trp Asp His Gin Thr Pro His Arg His Lys Plie 
225 230 235 240 

Leu Pro Glu Arg Tyr Pro Asp Lys Gly Phe Asp Asp Asn Tyr Cys Arg 
245 250 255 

Asn Pro Asp Gly Gin Pro Arg Pro Trp Cys Tyr Thr Leu Asp Pro His 
260 265 270 

Thr Arg Trp Glu Tyr Cys Ala He Lys Thr Cys Ala Asp Lys Ala Ser 
275 280 ' 285 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Sef Gly Gly Gly Gly Ser Leu 
290 295 300 

Glu Gly Gin Arg Lys Arg Arg Asn Thr He His Glu Phe Lys Lys Ser 
305 310 315 320 

Ala Lys Thr Thr Leu He Lys He Asp Pro Ala Leu Lys He Lys Thr 
325 330 335 

Lys Lys Val Asn Thr Ala Asp Gin Cys Ala Asn Arg Cys Thr Arg Asn 
340 345 350 

Lys Gly Leu Pro Phe Thr Cys Lys Ala Phe Val Phe Asp Lys Ala Arg 
355 360 365 

Lys Gin Cys Leu Trp Phe Pro Phe Asn Ser Met Ser Ser Gly Val Lys 
370 375 380 

Lys Glu Phe Gly His Glu Phe Asp Leu Tyr Glu Asn Lys Asp Tyr He 
385 390 395 400 

Arg Asn Cys He He Gly Lys Gly Arg Ser Tyr Lys Gly Thr Val Ser 
405 410 415 

He Thr Lys Ser Gly He Lys Cys Gin Pro Trp Ser Ser Met He Pro 
420 425 430 

His Glu His Ser Tyr Arg Gly Lys Asp Leu Gin Glu Asn Tyr Cys Arg 
435 440 445 

Asn Pro Arg Gly Glu Glu Gly Gly Pro Trp Cys Phe Thr Ser Asn Pro 
450 455 460 

Glu Val Arg Tyr Glu Val Cys Asp He Pro Gin Cys Ser Glu Val Glu 
465 470 475 480 

Cys Met Thr Cys Asn Gly Glu Ser Tyr Arg Gly Leu Met Asp His Thr 
485 490 495 



Glu Ser Gly Lys He Cys Gin Arg Trp Asp His Gin Thr Pro His Arg 
500 505 510 



His Lys Phe Leu Pro Glu Arg Tyr Pro Asp Lys Gly Phe Asp Asp Asn 
515 520 525 
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Tyr Cvs Arg Asn Pro Asp Gly Gin Pro Arg Pro Trp Cys Tyr Thr Leu 
530 535 540 

Asp Pro His Thr Arg Trp Glu Tyr Cys Ala He Lys Thr Cys Ala Asp 
545 550 555 560 

Lys Ala Asp Asp Asp Asp Lys His His His His His His His 
565 570 

(2) INFORMATION FOR SEQ ID NO: 3: 

> 

(i) SEQUENCE CHARACTERISTICS: ' ' 

(A) LENGTH: 1692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGGGGTGGC TCCCACTCCT GCTGCTTCTG ACTCAATGCT TAGGGGTCCC TGGGCAGCGC 60 

TCGCCATTGA ATGACTTCCA AGTGCTCCGG GGCACAGAGC TACAGCACCT GCTACATGCG 120 

GTGGTGCCCG GGCCTTGGCA GGAGGATGTG GCAGATGCTG AAGAGTGTGC TGGTCGCTGT 180 

GGGCCCTTAA TGGACTGCCG GGCCTTCCAC TACAACGTGA GCAGCCATGG TTGCCAACTG 240 

CTGCCATGGA CTCAACACTC GCCCCACACG AGGCTGCGGC GTTCTGGGCG CTGTGACCTC 300 

TTCCAGAAGA AAGACTACGT ACGGACCTGC ATCATGAACA ATGGGGTTGG GTACCGGGGC 360 

ACCATGGCCA CGACCGTGGG TGGCCTGCCC TGCCAGGCTT GGAGCCACAA GTTCCCGAAT 420 

GATCACAAGT ACACGCCCAC TCTCCGGAAT GGCCTGGAAG AGAACTTCTG CCGTAACCCT 480 

GATGGCGACC CCGGAGGTCC TTGGTGCTAC ACAACAGACC CTGCTGTGCG CTTCCAGAGC 540 

TGCGGCATCA AATCCTGCCG GGAGGCCGCG TGTGTCTGGT GCAATGGCGA GGAATACCGC 600 

GGCGCGGTAG ACCGCACGGA GTCAGGGCGC GAGTGCCAGC GCTGGGATCT TCAGCACCCG 660 

CACCAGCACC CCTTCGAGCC GGGCAAGTTC CTCGACCAAG GTCTGGACGA CAACTATTGC 720 

CGGAATCCTG ACGGCTCCGA GCGGCCATGG TGCTACACTA CGGATCCGCA GATCGAGCGA 780 

GAGTTCTGTG ACCTCCCCCG CTGCGGGTCC GAGGCACAGC CCCGCCTCGA GGGCGGTGGC 840 

GGTTCTGGTG GCGGTGGCTC CGGCGGTGGC GGTTCTCTAG AGGGACAAAG GAAAAGAAGA 900 

AATACAATTC ATGAATTCAA AAAATCAGCA AAGACTACCC TAATCAAAAT AGATCCAGCA 960 
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CTGAAGATAA AAACCAAAAA AGTGAATACT GCAGACCAAT GTGCTMTAG ATGTACTAGG 1020 
AATAAAGGAC TTCCATTCAC TTGCAAGGCT TTTGTTTTTG ATAAAGCAAG AAAACAATGC 1080 
CTCTGGTTCC CCTTCAATAG CATGTCAAGT GGAGTGAAAA AAGAATTTGG CCATGAATTT 1140 
GACCTCTATG AAAACAAAGA CTACATTAGA AACTGGATCA TTGGTAAAGG ACGCAGCTAC 1200 
AAGGGAACAG TATCTATCAC TAAGAGTGGC ATCAAATGTC AGCCCTGGAG TTCCATGATA 1260 
CCACACGMC ACAGCTATCG GGGTAAAGAC CTACAGGAAA ACTACTGTCG AAATCCTCGA 1320 
GGGGAAGAAG GGGGACCCTG GTGTTTCACA AGCAATCCAG- AGGTACGCTA CGAAGTCTGT 1380 
GACATTCCTC AGTGTTCAGA AGTTGAATGC ATGACCTGCA ATGGGGAGAG TTATCGAGGT 1440 
CTCATGGATC ATACAGAATC AGGCAAGATT TGTCAGCGCT GGGATCATCA GACACCACAC 1500 
CGGCACAAAT TCTTGCCTGA AAGATATCCC GACAAGGGCT TTGATGATAA TTATTGCCGC 1560 
AATCCCGATG GCCAGCCGAG GCCATGGTGC TATACTCTTG ACCCTCACAC CCGCTGGGAG 1620 
TACTGTGCAA TTAAAACATG CGCTGACAAA GCTGACGACG ACGACAAACA CCACCACCAC 1680 
CACCACCACT AG 1692 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Trp Leu Pro Leu Leu Leu Leu Leu Thr Gin Cys Leu Gly Val 
15 10 15 

Pro Gly Gin Arg Ser Pro Leu Asn Asp Phe Gin Val Leu Arg Gly Thr 
20 25 30 

Glu Leu Gin His Leu Leu His Ala Val Val Pro Gly Pro Trp Gin Giu 
35 40 45 

Asp Val Ala Asp Ala Glu Glu Cys Ala Gly Arg Cys Gly Pro Leu Met 
50 55 60 

Asp Cys Arg Ala Phe His Tyr Asn Val Ser Ser His Gly Cys Gin Leu 
65 70 75 80 
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Leu Pro Trp Thr Gin His Ser Pro His Thr Arg Leu Arg Arg Ser Gly 
85 90 95 

Arg Cys Asp Leu Phe Gin Lys Lys Asp Tyr Val Arg Thr Cys He Met 
100 105 110 

Asn Asn Gly Val Gly Tyr Arg Gly Thr Met Ala Thr Thr Val Gly Gly 
115 120 125 

Leu Pro Cys Gin Ala Trp Ser His Lys Phe Pro Asn Asp His Lys Tyr 
130 135 ' 140 

" Thr Pro Thr Leu Arg Asn Gly Leu Glu Glu Asn Phe Cys Arg Asn Pro 
145 150 155 160 

Asp Gly Asp Pro Gly Gly Pro Trp Cys Tyr Thr Thr Asp Pro Ala Val 
165 170 175 

Arg Phe Gin Ser Cys Gly He Lys Ser Cys Arg Glu Ala Ala Cys Val 
180 185 190 

Trp Cys Asn Gly Glu Glu Tyr Arg Gly Ala Val Asp Arg Thr Glu Ser 
195 200 205 

Gly Arg Glu Cys Gin Arg Trp Asp Leu Gin His Pro His Gin His Pro 
210 215 220 

Phe Glu Pro Gly Lys Phe Leu Asp Gin Gly Leu Asp Asp Asn Tyr Cys 
225 230 235 240 

Arg Asn Pro Asp Gly Ser Glu Arg Pro Trp Cys Tyr Thr Thr Asp Pro 
245 250 255 

Gin He Glu Arg Glu Phe Cys Asp Leu Pro Arg Cys Gly Ser Glu Ala 
260 265 270 

Gin Pro Arg Leu Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
275 280 285 

Gly Gly Gly Ser Leu Glu Gly Gin Arg Lys Arg Arg Asn Thr He His 
290 295 300 

Glu Phe Lys Lys Ser Ala Lys Thr Thr Leu He Lys He Asp Pro Ala 
305 310 315 320 

Leu Lys He Lys Thr Lys Lys Val Asn Thr Ala Asp Gin Cys Ala Asn 
325 330 335 

Arg Cys Thr Arg Asn Lys Gly Leu Pro Phe Thr Cys Lys Ala Phe Val 
340 345 350 

Phe Asp Lys Ala Arg Lys Gin Cys Leu Trp Phe Pro Phe Asn Ser Met 
355 360 365 



Ser Ser Gly Val Lys Lys Glu Phe Gly His Glu Phe Asp Leu Tyr Glu 
370 375 380 
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Asn Lys Asp Tyr lie Arg Asn Cys He He Gly Lys Gly Arg Ser Tyr 
385 390 395 400 

Lys Gly Thr Val Ser He Thr Lys Ser Gly He Lys Cys Gin Pro Trp 
405 410 415 

Ser Ser Met He Pro His Glu His Ser Tyr Arg Gly Lys Asp Leu Gin 
420 425 430 

Glu Asn Tyr Cys Arg Asn Pro Arg Gly Giu,Glu Gly Gly Pro Trp Cys 
435 440 - 445 

Phe Thr Ser Asn Pro Glu Val Arg Tyr Glu Val Cys Asp He Pro Gin 
450 455 460 

Cys Ser Glu Val Glu Cys Met Thr Cys Asn Gly Glu Ser Tyr Arg Gly 
465 470 475 480 

Leu Met Asp His Thr Glu Ser Gly Lys He Cys Gin Arg Trp Asp His 
485 490 495 

Gin Thr Pro His Arg His Lys Phe Leu Pro Glu Arg Tyr Pro Asp Lys 
500 505 510 

Gly Phe Asp Asp Asn Tyr Cys Arg Asn Pro Asp Gly Gin Pro Arg Pro 
515 520 525 

Trp Cys Tyr Thr Leu Asp Pro His Thr Arg Trp Glu Tyr Cys Ala He 
530 535 540 

Lys Thr Cys Ala Asp Lys Ala Asp Asp Asp Asp Lys His His His His 
545 550 555 560 

His His His 
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ATGTGGGTGACCAAACTCCTGCCAGCCCTGC7GCTGCAGCATGTCCTCCTGCATCTCCTC 



P 

o 



7ACACCCACTGG77TGAGGACGGTCGGGACGA.CGACG7CGTACAGGAGGACGTAGAGGAG 



60 



1 M W V T K L L P A L L L -> Q H V L L H L L 20 

C7GCTCCCCATCGCCA7CCCC7A7GCAGAGGGACAAAGGAJLAAGAAGAAA7ACAA77CA7 
GACGAGGGGTAGCGGTAGGGGATACGTCTCCCTGTTTCCTTTTCTTCTTTATGTTAAGTA 
21 LL P I A I PYAEGQRKRRN T I H 40 

o 

l?i GAATTCAJWIAATCAGCAAAGACTACCCTAATCAAAATAGATCCAGCACTGAAGATAJLAA ^ ^ 

CP *" CTTAAGTTTTTTAGTCGTTTCTGATGGGATTAGTTTTATCTAGGTCGTGACTTCTATTTT 



41 E FKKSAKTTLI K I D PAL K I K 60 



a 
a 
ji 

, f l ACCAAAJW^GTGAATACTGCAGACCAJiTGTGCTAATAGATGTACTAGGAATAAAGGACTT 

H TGGTTTTTTCACTTATGACGTCTGGTTACACGATTATCTACATGATCCTTATTTCCTGAA 
Jl, 617KKVN7ADQCANRC7RNKGL 80 

51- 

u CCATTCACTTGCAAGGCTTTTGTTTTTGATAAAGCAAGAAAACAATGCCTCTGGTTCCCC 

If] GGTAAGTGAACGTTCCGAAAACAAAAACTATTTCGTTCTTTTGTTACGGAGACCAAGGGG 



81 PFTCKAFVFDKARKQCLWF? 100 

TTCAATAGCATGTCAAGTGGAGTGAAAAAAGAATTTGGCCATGAATTTGACCTCTATGAA 
AAGTTATCGTACA.GTTCACCTGACTTTTTTCTTAAACCGGTA.CTTAAACTGGAGA.TACTT 
101 FNSMSSGVKKEFGHEFDLYE 120 

AACAAAGACTACATTAGAAACTGCATCATTGGTAAAGGACGCAGCT ACAAGGGAACAGT A ^ 
TTGTTTCTGATGTAATCTTTGACGTAGTAACCATTTCCTGCGTCGATGTTCCCT7GTCAT 
121 NKDYIRNCIIGKGRSYKGTV 140 



TC7A7CAC7AAGAG7GGCATCAAA7G7CAGCCC7GGAG77CCA7GATACCACACGAACAC 

421 f + ~ -— t 

AGA7AG7GA.77C7GACCG7AG777ACAG7CGGGACC7CAAGG7A.G7A7GG7G7GC77G7G 

141 S I T K S G I K C C P W S S M I ? H E r- 
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AGCTATCGGGGTAAAGACCTACAGGAAnACTACTGTCGAAATCCTCGAGGGGAAGAAGGG 

481 -r -r -f- 4- -r 540 

TCGATAGCCCCATTTCTGGATGTCCTTTTGATGACAGCTTTAGGAGCTCCCCTTCTTCCC 
161 SYRGKDLQENYCRN? R G E E G 180 

_ ^ GGACCCTGGTGTTTCACAAGCAATCCAGAGGTACGCTACGAAGTCTGTGACATTCCTCAG 
CC7GGGACCACAAAGTGTTCGTTAGGTCTCCATGCGATGCTTCAGACACTGTAAGGAGTC 

181 GPVJCFTSNPEVRYEVCDI ? Q 200 

^ ^ T G T T C AG AAG T T G AAT G CAT G A C C T G C AAT G G G GAG AG T TAT C G AG G T C T C AT G G A.T C A.T 
ACAAGTCTTCAACTTA.CGTACTGGACGTTA.CCCCTCTCAATAGCTCCAGAGTACCTAGTA 
201CSEVECMTCNGESYRGLMDK 220 

ACAGAATCAGGCAAGA.TTTGTCA.GCGCTGGGA.TCA.TCAGACACCA.CACCGGCA.CAAA.TTC 

H TGTCTTAGTCCGTTCTAAACAGTCGCGACCCTAGTAGTCTGTGGTGTGGCCGTGTTTAAG 

€1 

ril 221 TSSGKICQRWDHQTPHRKKF 240 

Si 

C! TTGCCTGAAAGATATCCCGACAAGGGCTTTGATGATAATTATTGCCGCAATCCCGATGGC 
AACGGACTTTCTA.TAGGGCTGTTCCCGAAACTACTATTAATAACGGCGTTAGGGCTACCG 

241 LPERYPDKGFDDNYCRNPDG 260 

= 

Q CAGCCGAGGCCATGGTGCTATACTCTTGACCCTCACACCCGCTGGGAGTACTGTGCAATT 

^ J GTCGGCTCCGGTACCACGATATGAGAACTGGGAGTGTGGGCGACCCTCATGACACGTTAA 

H| 261 QPR PWCYT LDPKTRWEYCAI 280 

5 

p AAAACATGCGCTGACAATACTATGAATGACACTGATGTTCCTTTGGAAACAACTGAATGC 
TTTTGTACGCGACTGTTATGATACTTACTGTGACTACAAGGAAACCTTTGTTGACTTACG 
231 KTC ADNTMN DTDV? LETTEC 300 

ATCCAAGGTCAAGGAGAAGGCTACAGGGGCACTGTCAATA.CCATTTGGAATGGAATTCCA 
TAGGTTCCAGTTCCTCTTCCGATGTCCCCGTGA.CAGTTATGGTAAACCTTACCTTAAGGT 
301 IQGQGEGYRGTVNTIWNGIP 320 

TGTCAGCGTTGGGATTCTCAGTATCCTCACGAGCATGACATGACTCCTGAJUVATTTCAAG 

961 + -r + +- -r -i- 1020 

A.CAGTCGCAACCCTAAGAGTCATA.GGAGTGCTCGTACTGTA.CTGAGGACTTTTAAAGTTC 

321 C Q R W D S Q Y ? H E K D M T P E N F K 34 0 

TGCAAGGACCTACGAGAAAA.TTACTGCCGA_AATCCAGATGGGTCTGAATCACCCTGGTGT 

1021 -r -r 1080 

ACGTTCCTGGATGCTCTTTTAA.TGA.CGGC7TTA.GG7CTACCCAGACTTAGTGGGACCACA 

34 1 C K D L R E M Y C R N ? D G S E S P W C 3 60 

T7TAjCCACTGA.TCCAAACATCCGAGTTGGCTACTGCTCCCAAATTCCAAACTGTGATA.TG 

1Q81 -r -r 1140 

.A.AA.TGG7GACTAGGTTTGTA.GGCTCAACCGATGACGAGGGTTTAA.GGTTTGACACTATAC 

351 F 7 T D?N I RVGYCSQI ? N C D M 3 30 

( fin » i , -, /-J \ 
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TCACATGGACAAGATTGTTATCGTGGGAATGGCAA-iJlATTATATGGGCAACTTATCCCAA 17QQ 
1141 AGTGTACCTGTTCTAACAATAGCACCCTTACCGTTTTTAATATACCCGTTGAATAGGGTT 
381SKGQDCYRGNGKNYMGNLSQ 4 00 

ACAAGATCTGGACTAACATGTTCAATGTGGGACAAGAACA.TGGAAGACTTACATCGTCAT ^ 
TGTTCTAGACCTGATTGTACAAGTTACACCCTGTTCTTGTACCTTCTGAATGTAGCAGTA 
401TRSGLTCSMWDKNME D L H R H 4 20 

^TCTTCTGGGAACCAGATGCAAGTAAGCTGAATGAGAATTACTGCCGAAATCCAGATGAT 

1261 + +- ~ — + 1320 

TAGAAGACCCTTGGTCTACGTTCATTCGACTTACTCTTAATGACGGCTTTAGGTCTACTA 

421 I F W E P D AS KL N E NY C R N P D D 440 

GACGCTCA.TGGACCCTGGTGCTACACGGGAAATCCACTCATTCCTTGGGATTATTGCCCT 
O ' " CTGCGAGTACCTGGGACCACGATGTGCCCTTTAGGTGAGTAAGGAACCCTAATAACGGGA 

f|i 441 DAKGPwCYTGNPLIPWDYC ? 4 60 

p ATTTCTCGTTGTGAAGGTGATACCACACCTACAATAGTCAATTTAGACCATCCCGTAATA 1440 

y ;f ~^ ' TAAAGAGCAACACTTCCACTATGGTGTGGATGTTATCAGTTAAATCTGGTAGGGCATTAT 

a? 

Mi 4 61 I S R C E G D T T P T I V N L D H P V I 480 

CI TCTTGTGCCAJIAACGAAACAATTGCGA.GTTGTAAATGGGA.TTCCAACACGAACAAACATA 

;:; 1441 + + + + + + 1500 

- AGAACACGGTTTTGCTTTGTTAACGCTCAACATTTACCCTAA.GGTTGTGCTTGTTTGTA.T 

yi 481 SCAKTKQLRVVNGI PTRTNI 500 

CI 

p GGATGGATGGTTAGTTTGAGATACAGAAATAAACATATCTGCGGAGGATCATTGATAAAG 
CCTACCTA.CCAATCAAACTCTATGTCTTTATTTGTATAGACGCCTCCTAGTAACTATTTC 
501 G W M v S L R Y R N K K I C G G S L I K 5 20 

GAGAGTTGGGTTCTTACTGCACGACAGTGTTTCCCTTCTCGAGA.CTTGAAAGATTATGAA ^ 
CTCTCAACCCAAGAATGACGTGCTGTCACAAAGGGAAGAGCTCTGAACTTTCTAATACTT 
521 ESWVLTARQCFPSRDLKDYE 540 

GCTTGGCTTGGAATTCATGATGTCCACGGAAGAGGAGATGAGAAATGCAAACAGGTTCTC ^ ^ ^ 
i ° Z ~ CGAACCGAACCTTAAGTACTACAGGTGCCTTCTCCTCTACTCTTTACGTTTGTCCAAGAG 
54 1 A vv L G I H D V K G R G D E K C K Q V L 5 60 



.63 



1740 



TTACAPA.GGGTCGACCATATACCGGGACTTCCTAGTCTAGACCAAAATTACTTCGAACGG 
5 61 M V S Q L V Y G ? E G S D L V L M K L A 53 0 



1300 



AGGCCTGCTGTCCTGGATGATTTTGTTAGTACGATTGATTTACCTAATTATGGATGCACA 

"! 7 ^ ]_ - i ! i 

TCCGGACGACAGGACCTACTAAAACAATCATGCTAACTAAATGGATTAATACCTACGTGT 

53 1 R ? A V 1 D D F V S T 1 D L P M Y G C T 60 0 
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(continued) 

ATTCCTGAAAAGACCAGTTGCAG7GTTTATGGCTGGGGCTACACTGGATTGATCAACTAT 

g q ■[ | • | ! 1 f i b bU 

TAAGGACTTTTCTGGTCAACGTCACAAATACCGACCCCGATGTGACCTAACTAGTTGATA 

601 IPEKTSCSVYGWGYTG LINY 620 

GATGGCCTATTACGAGTGGCACATCTCTAT ATAATGGGA^ q 
i8Sl CTACCGGATAATGCTCACCGTGTAGAGATAT 

621 D " G L L R V A H L Y I M G N E K C S Q K 640 

CATCGAGGGAAGGTGACTCTGAATGAGTCTGAAATATGTGCTGGGGCTGAAAAGATTGGA 

TLQ-)-< i • ! ' ~ iyBU 

GTAGCTCCCTTCCACTGAGACTTACTCAGACTTTATACACGACCCCGACT T TTCTAACC T 
541 h R G K V T L N E S E I C A G A E K I G 660 

TCAGGACCATGTGAGGGGGATTATGGTGGCCCACTTGTT^ 2Q4Q 
X 98 1 AGTCCTGGTACACTCCCCCTAATAC^ 



661 S G 



PCEGDYGGPLVCEQKKMR 680 



ATGGTTCTTGGTGTCATTGTTCCTGGTCGTGGATG x GCCATTCCAAATCGTCCTGGTATT ^ ^ 

2041 TACCAAGAACCACAGTAACAAGGACCAGCACCTACACGGTAAG^ 

681MVLGVIVPGRGCAIPNRPGI 700 

TTTGTCCGAGTAGCATATTATGCAAAATGGATACACAAAATTATTTTAACAT q 
2101 AA^CAGGCTCATCGTATAATACGTTTT^ 



701 E V R V A Y 



YAKWI H K I ILTYKV 720 



CCACAGTCATAG 

2161 ^= 2172 

GGTGTCAGTATC 

721 P Q S * 723 
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ATGGGGTGGCTCCCACTCCTGC7GCTTCTGACTCAATGCTTAGGGGTCCCTGGGCA.GCGC 



60 



TACCCCACCGAGGGTGAGGACGACGAAGACTGAGTTACGAATCCCCAGGGACCCGTCGCG 
1MGWLPLLLLLTQCLGVPGQ?. 20 



TCGCCATTGAATGACTTCCAAGTGCTCCGGGGCACSGAGCT.ACAGCACCTGCTACATGCG 
1 AGCGGTAACTTACTGAAGGTTC 

ISPLNDFQVLRGTELQHLLHA 



120 
40 



GTGGTGCCCGGGCCTTGGCAGC-AGGATGTGGCAGATGCTGAA.GAGTGTGCTGGTCGCTGT 

j r iOU 



1 4- + + ~r 

" CACCACGGGCCCGGAACCGTCCTCCTACACCGTCTACGACTTCTCACACGACCAGCG^CA 
1 V V ? G ? W Q E D V A D A E E C A G 3. C 



60 



240 



GGGCCCTTAATGGACTGCCGGGCCTTCCACTACAACGTGAGCAGCCA 
1 CCCGGGAATTACCTGACGGCCCGGAAGGTGATGTTGCACTCGTCGGTACCAACGGTTGAC 
lGPLMDCRAFHYNVSSaGCQL 80 



CTGCCATGGACTCAACACTCGCCCCACACGAGGCTGCGGCGT^ 
1 GACGGTACCTGAGTTGTGAGCGGGGTGTGCTCCGACGCCGCAAGACCCG 
1LPWTQHSPKTRLRRSGRCDL 

TTCCAGAAGAAA.GACTACGTACGGA.CCTGCAT CATGAJ\CAA.TGGGGTTGGGTACCGGGGC 
1 AAGGTCT TCTTT CTGATGCATGCCTGGACGT AGTACTTGTT ACCC CAACCC AT GGCCCCG 
1 e-QKKDYVRTCIMNNGVGYRG 

ACCATGGCCACGACCGTGGGTGGCCTGCCCTGCCAGGCTTGGAGCCACAA^ 
1 TGGTACCGGTGCTGGCACCCACCGGACGGGACGGTCCGAACCTCGGTGTTCAAGGGCTTA 
1TMATTVGGLPCQAWSHKFPN 



300 
100 

360 
120 

420 
14 0 



480 



GA-CACAAGTACACGCCCACTCTCCGGAATGGCCTGGAAGAGAACTTCTGCCGTAACCCT 

1 -r + -r + + ZZ 

CTAGTGTTCATGTGCGGGTGAGAGGCCTTACCGGACCTTCTCTTGAAGACGGCATT^V--oA 

! n H K Y T P T L R N G L E E N F C R N r ? ^ 60 
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GATGGCGACCCCGGAGG7CCTTGGTGCTACACAACAGACCCTGCTGTGCGCTTCCAGAGC 
CTACCGCTGGGGCCTCCAGGAACCACGATGTGT7GTCTGGGACGACACGCGAAGGTCTCG 



540 

161 DGDPGGPWCYTTDPAVRFQS 180 



^ TGCGGCATCAAATCCTGCCGGGAGGCCGCGTGTGTCTGGTGCAATGGCGAGGAATACCGC 

ACGCCGTAGTTTA.GGACGGCCCTCCGGCGCACACAGACCACGTTACCGCTCCTTATGGCG ^ 
181 CG IKSCREAACVWCNGEEYR 200 

GGCGCGGTAGACCGCACGGAGTCAGGGCGCGAGTGdCAGCGCTGGGA.TCTTCAGCACCCG 
CCGCGCCATCTGGCGTGCCTCAGTCCCGCGCTCACGGTCGCGACCCTAGAAGTCGTGGGC & °° 
201 GAVDRTESGRECQRWDLQK? 220 

^ CACCAGCACCCCTTCGAGCCGGGCAAGTTCCTCGACCAAGGTCTGGACGACAACTATTGC 
CI GTGGTCGTGGGGAAGCTCGGCCCGTTCAAGGA.GCTGGTTCCAGACCTGCTGTTGATAACG 

m 
m 
a 

r'i CGGAATCCTGACGGCTCCGAGCGGCCATGGTGCTACACTACGGATCCGCAGATCGAGCGA 

=!? 721 + 4- 4- -f + -r 780 

W GCCTTAGGACTGCCGAGGCTCGCCGGTACCACGATGTGATGCCTAGGCGTCTAGCTCGCT 



221 HQHPFEPGKFLDQGLDDNYC 2 40 



241RNPDGSERPWCYTTDPQIER 2 50 



GAGTTCTGTGACCTCCCCCGCTGCGGGTCCGAGGCACAGCCCCGCCAAGAGGCCA.CAACT 

781 + 4- 4- 4- 4- 4- 8 40 

-U CTCAAGACACTGGAGGGGGCGACGCCCAGGCTCCGTGTCGGGGCGGTTCTCCGGTGTTGA 

\ n 261 EFCDLPRCGSEAQPRQEATT 280 

C| 

GTCAGCTGCTTCCGCGGGAAGGGTGAGGGCTACCGGGGCACAGCCAATACCACCACTGCG 

^ 841 r 4 -i i 900 

CAGTCGACGAAGGCGCCCTTCCCACTCCCGATGGCCCCGTGTCGGTTATGGTGGTGACGC 
28IVSCFRGKGEGYRGTANTTTA 300 

GGCGTACCTTGCCAGCGTTGGGACGCGCAAATCCCGCATCAGCACCGATTTACGCCAGAA 
CCGCATGGAACGGTCGCAACCCTGCGCGTTTAGGGCGTAGTCGTGGCTAAATGCGGTCTT 
301 GVPCQRWDAQI PHQKRFTPE 320 

^ ^ AAATACGCGTGCAAAGACCTTCGGGAGAACTTCTGCCGGAACCCCGACGGCTCAGAGGCG 
TTTATGCGCACGTTTCTGGAAGCCCTCTTGAAGACGGCCTTGGGGCTGCCGAGTCTCCGC 
321 KYACKDLREN FC RN ? DGSEA 340 

CCCTGGTGCTTCACAC7GCGGCCCGGCATGCGCGCGGCCTTTTGCTA.ee AG ATCCGGCGT 

1021 4- - _ 1080 

GGGACCACGAAGTGTGACGCCGGGCCGTACGCGCGCCGGAAAACGATGGTCTAGGCCGCA 

34 1 ? W C FT L R ? G M R A A F C Y Q I R R 3 60 

TGTACAGACGACGTGCGG CCCC AGGACTGCTACCA.CGGCGCA.GGGGAGCAGTACCGCGGC 
ACATGTCTGCTGCACGCCGGGGTCCTGACGATGGTGCCGCGTCCCCTCGTCATGGCGCCG 

361 C T D D V R ? Q D C Y K G AG E Q Y RG 380 
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ACGGTCAGCAAGACCCGCAAGGGTGTCCAGTGCCA.GCGCTGGTCCGCTGAGACGCCGCAC 
TGCCAGTCG7TCTGGGCGTTCCCA.CAGGTCACGGTCGCGACCAGGCGACTC7GCGGCGTG 
3 3 1 T V S K T R K G V Q C Q R W S A E T P H 400 

^ AAGCCGCAGTTCACGTTTACCTCCGAACGGCATGCACAACTGGAGGAGAACTTCTGCCGG 
TTCGGCGTCAAGTGCAAATGGAGGCTTGGCGTACGTGTTGACCTCCTCTTGAAGACGGCC 
401KPQFTFTSEPHAQLEENFCR 4 20 

_ ^ AACCCAGATGGGGATAGCCATGGGCCCTGGTGCTACACGATGGACCCAAGGACCCCATTC 
TTGGGTCTACCCCTATCGGTACCCGGGACCACGA.TGTGCTACCTGGGTTCCTGGGGTA.AG 
421 N ? DG DS K G P W C Y T M D P RT ? F 440 

^ GACTA.CTGTGCCCTGCGA.CGCTGCGCTGA.TGACCAGCCGCCATCAATCCTGGACCCCCCA 
O CTGA.TGACACGGGACGCTGCGACGCGACTACTGGTCGGCGGTAGTTA.GGACCTGGGGGGT 



II! 
Ci 
0 



441 D Y C A. L R R C A. DDQPPSILD?? 4 60 



€1 

m 

LI 

X ^ GACCAGGTGCAGTTTGAGAAGTGTGGCAAGAGGGTGGATCGGCTGGATCAGCGGCGTTCC 

yjl CTGGTCCACGTCAAACTCTTCACACCGTTCTCCCACCTAGCCGACCTAGTCGCCGCAAGG 

=.?i 

£ r 4 61DQVQFEKCGKRVDRLDQRRS 480 
AAGCTGCGCGTGGTTGGGGGCCATCCGGGCAACTCA.CCCTGGACAGTCAGCTTGCGGAAT 

fef 14 41 4- -i -r -r h ~ 1500 

sll TTCGACGCGCACCAACCCCCGGTAGGCCCGTTGAGTGGGACCTGTCAGTCGAACGCCTTA 



481 KLRVVGGK ?GNS PWTVSLRN 500 

CGGCAGGGCCA.GCATTTCTGCGGGGGGTCTCTAGTGAAGGA.GCAGTGGATACTGACTGCC 

1501 -+- + -r 15 60 

GCCGTCCCGGTCGTAAAGACGCCCCCCAGAGA.TCA.CTTCCTCGTCACCTA.TGACTGACGG 

501 R Q G Q K F C G G S L V KE Q W I L T A 520 

CGGCAGTGCTTCTCCTCCTGCCA.TATGCCTCTCACGGGCTATGAGGTATGGTTGGGCA.ee 
GCCGTCA.CGAAGAGGAGGACGGTATACGGAGAGTGCCCGATACTCCATACCAACCCGTGG 
521RQCFSSCHMPLTGYEVWLGT 540 

^ CTGTTCCAGAACCCACAGCATGGAGAGCCAAGCCTACAGCGGGTCCCAGTAGCCAAGA-TG 
GACAAGGTCTTGGGTGTCGTACCTCTCGGTTCGGATGTCGCCCAGGGTCA.TCGGTTCTAC 
54 1 L F Q N ? Q H G E ? S L Q R V ? V A K M 5 SO 



GiG; >oTGGGCCC _ CAGGCTCCC.AGCTTGTCCTGCTC--_AGCTGGA.GAG 

1681 -r -r 1740 

CACAC.ACCCGGGAGTCCG.AGGGTCGAACAGG.ACGAGTTCGACCTCTCTAG.ACACTGGGAC 
5 51 V C G ? S G S Q L V L L K 1 E R S V T L 56 0 

AACCA.GCGTGTGGCCeTGATCTGCCTGCCCCCTG.AA.TGGTA.TGTGGTGCCTCCA.GGGA.ee 

1741 -r -f- -r 1300 

TTGGTCGCACACCGGGACTAGA.CGGACGGGGGA.CT7ACCATACACCACGGAGGTCCCTGG 

531 M Q R V A L I C L ? ? E W Y V V ? ? G I 50 0 
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(continued) 

AAGTGTGAGATTGCAGGCTGGGGTGAGACCAAAGGTACGGGTAATGACACAGTCCTAAAT 

1801 + + + + + + I860 

TTCACACTCTAACGTCCGACCCCACTCTGGTTTCCATGCCCATTACTGTGTCAGGATTTA 

601 KCEIAGWGETKGTGNDTVLN 620 

GTGGCCTTTCTGAATGTTATCTCCAACCAGGAGTGTAACATCAAGCACCGAGGACGTGTG 

1861 + + + 7~+ + + 1920 

CACCGGAAAGACT7ACAATAGAGGTTGGTCCTCACATTGTAGTICGTGGCTCCTGCACAC 

621 VAFLNVISNQECNIKHRGRV 64 0 

CGGGAGAGTGAGATGTGCACTGAGGGACTGTTGGCCCCTGTGGGGGCCTGTGAGGGTGAC 

192i + +• + h 1980 

GCCCTCTCACTCTACACGTGACTCCCTGACAACCGGGGACACCCCCGGACACTCCCACTG 

641 RES EMCTEGLLAPVGACZGD 660 

TACGGGGGCCCACTTGCCTGCTTTACCCACAACTGCTGGGTCCTGGAAGGAATTATAATC q 
ATGCCCCCGGGTGAACGGACGAAATGGGTGTTGACGACCCAGGACCTTCCTTAATATTAG 
661 YGG PLAC FT K NCWVL EG I I I 63 0 

CCCAACCGAGTATGCGCAAGGTCCCGCTGGCCAGCTGTCTTCACGCGTGTCTCTGTGTTT 

2041 r ~ H H + _ r 2100 

GGGTTGGCTCATACGCGTTCCAGGGCGACCGGTCGACAGAAGTGCGCACAGAGACACAAA 
681 PNRVCAES R W PAVFT RV SVF 700 

G TGG ACT GGATT C AC AAGGT CAT G AGACT GGGTTAG 
CACCTGACCTAAGTGTTCCAGTACTCTGACCCAATC 
701 VDWIHKVMRLG* 711 
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60 



61 
13 

121 

38 



GAATTCCACCATGGGGTGGCTCCCACTCCTGCTGCTTCTGACTCAATGCTTAGGGGTCCC 
CTTAAGGTGGTACCCCACCGAGGGTGAGGACGACGAAGACTGAGTTACGAATCCCCAGGG 

MGWLPLLLLLTQCLGVP 17 

TGGGCAGCGC7CGCCATTGAATGACT7CCAAGTGCTCCGGGGCACAGAGCTACAGCACCT 

■ j j 4 ■ - J_^U 

ACCCGTCGCGAGCGGTAACTTACTGAAGGTTCACGAGGCCCCGTGTCTCGATGTCGTGGA 

gqrsplndfqvl'r^gtelqkl 37 



180 



GCT^^TGCGGTGGTGCCCGGGCCTTGGCAGGAGGATGTGGCAGATGCTGAAGAGTGTGC 

21* ~ + + + 

CGATGTACGCCACCACGGGCCCGGAACCGTCCTCCTACACCGTCTACGACTTCTCACACG 

L H A VVPGPWQEDVADAEEC A 57 



O 

jjj TGG7CGC7G7GGGCCCTTAA7GGA.CTGCCGGGCCTTCCAC ^ ^ q 

t 181 ACCAGCGACACCCGGGAATTACCT^ 

S 58 GRCGPLMDCRAFKYNVSSKG 77 

ft TTGCCAACTGCTGCCATGGACTCAACACTCGCCCCACACGAGGCTGCGGCGTTCTGGGCG 

r i= ?41 _^ +. -r + -r T JUU 

~ AACGGTTGACGACGGTACCTGAGTTGTGAGCGGGGTGTGCTCCGACGCCGCAAGACCCGC 

g ?8 CQLLPWTQHS PHTRLRRSGR 97 

Lii. 

fl CTGTGACCTCTTCCAGAAGAAAGACTACGTA.CGGACCTGCATCATGAACAATGGGGTTGo 

CI GACACTGGAGA^GGTCTTCTTTCTGATGCATGCCTGGACGTAGTACTTGTTACCCCAACC 

O 9 g CDLFQKKDYVRTCIMNNGVG 117 



361 



1 i 



GTACCGGGGCACCATGGCCACGACCGTGGGTGGCCTGCCCTGCCAGGCTTGGAGCCrtCAA 

* -r + + ^ 

CATGGCCCCGTGGTACCGGTGCTGGCACCCACCGGACGGGACGGTCCGAACCTCGGTGTx 



420 

Y R G T M A T T V G G L P C Q A W S H K 137 



GTTCCCGAATGATCACAAGTACACGCCCACTCTCCGGAATGGCCTGGAAGAGAACTTCTG 
4 2 1 CAAGGGCTTACTAGTGTTCATGTGCGGGTGAGAGGCCTTACCGGACCT 



480 



138 



NDKKYT?TLR.NGLEENrC 157 



481 



54C 



CC^TAArcCTGATGGCGACCCCGGAGGTCCTTGGTGCTACACAACAGACCCTGCTGTGCG 

, -i 

GGCATTGGGACTACCGCTGGGGCCTCCAGGAACCACGATGTGTTGTCTGGGACGACACGC 

153 R N ? D G D ? G G ? W C Y T T D ? A V R H 
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CTTCCAGAGCTGCGGCATCAPJVTCCTGCCSGGAGGCCGCGTGTGTCTGG7GCAATGGCGA 

041 GAAGGTCTCGACGCCGTAGTTTAGGACGGCCCTCCGGCGCACACAGACCACGTTACCGCT D °° 

178 FQSCGIKSCREAACVWCNGE 197 

GGAATACCGCGGCGCGGTAGACCGCACGGAGTCAGGGCGCGAGTGCCAGCGCTGGGATCT 

601 + -i + ~ 650 

CCTTATGGCGCCGCGGCATCTGGCGTGCC7CAGTCCCGCGCTCACGGTCGCGACCCTAGA 

198 EYRGAVDRTESGRECQRWDL 217 

TCAGCACCCGCACCAGCACCCCTTCGAGCCGGGCAAIGTTCCTCGACCAAGGTCTGGACGA 
6 °~ AGTCGTGGGCGTGGTCGTGGGGAAGCTCGGCCCGTTCAAGGAGCTGGTTCCAGACCTGCT ' 2 ° 
218 QHPKQHPFEPGKFLDQGLDD 237 

CAACTATTGCCGGAATCCTGACGG 

' 2 1 GTTGATAACGGCCTTAGGACTGCCC-AGGCTCGCCGGTACCACGATGTGATGCCTAGGCGT ' 3 ° 

23 8 NYCRNFDGSERPWCYTTDPQ 257 

GATCGAGCGAGAGTTCTGTGACCTCCCCCGCTGCGGGTCCGAGGCACAGCCCCGCCTCGA 
781 CTAGCTCGCTCTCAAGACACTGGAGGGGGCGACGCCCAGGCTCCGTGTCGGGGCGGAGCT " 4 ° 
258 IEREFCDLPRCGSEAQPRLE 277 

GGGCGGTGGCGGTTCTGGTGGCGGTGGC7CCGGCGGTGGCGGTTCTCTAGAGGGACAAAG 

841 + * + + + + 900 

CCCGCCACCGCCAAGACCACCGCCACCGAGGCCGCCACCGCCAAGAGATCTCCCTGTTTC 

27Q GGGGSGGGGSGGGGSLEGQR 297 

G AAAAG AAGAAAT AC AAT T CAT GAAT T C AAAAAAT C AGC AAAG AC T AC C C T AAT C AAAAT 
901 CTTTTCTTCTTTATGTTAAGTACTTAAGTTTTTTAGTCGTTTCTGATGGGATTAGTTTTA " °° 
298 KRRNTIHEFKKSAKTTLIKI 317 

AGATCCAGCACTGAAGATAAAAACCAAAAAAGTGAATA.CTGCAGACCAATGTGCTAATAG 
961 TCTAGGTCGTGACTTCTATTTTTGGTTTTTTCACTTATGAGGTCTGGTTACACGATTATC ~° 2 ° 
31 g DPALKIKTKKVNTADQCANR 337 

ATGTACTAGGAATAAAGGACTTCCATTCACTTGCAAGGCTTTTGTTTTTGATAAAGCAAG 

1021 T ACAT GAT CCTTATTTCCT G AAG G T AAG T G AAC G T T C C G AAAAC AAAAAC T AT T T C G T T C 
333 C T R N K G L P E T C K A F V F D K A R 357 

AAAACAA-TGCCTCTGGTTCCCC7GCAATAGCATGTCAAG7GGAGTGAAAAAAGAATTTGG 

1081 ^ 1K0 

TTTTGTTACGGAGA'^'s-AAGGGGAAGTTATCGTACA.GTTCACCTCA.CTTTTTTCTTAAACC 

35 8 K Q C L W F ? F N S M S SGVKKEFG 377 

CCATGAATTTGACCTCTATGAAAACAAAGACTACATTAGA 
1141 GGTACTTAAJ\CTGGAGATAC7777G77TC7GA7G7AATC7TTGACG7AGTAAGCA77TCC 
378 H H: F D L Y E M K D Y I R N C I I G K G 3 97 
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ACGCAGCTACAAGGGAACAGTATCTATCACTAAGAG7GGCATCAAATGTCAGCCCTGGAG 

1201 + -r -r + -r -r 1260 

TGCGTCGATGTTCCCTTGTCATAGATAGTGATTCTCACCGTAGTTTACAGTCGGGACCTC 
3 98 RSYKGTVSITKSGIKCQPWS 417 

TTCCATGATACCACACGAACACAGCTATCGGGGTAJiAGACCTACAGGAAAACTACTGTCG 
AAGGTACTATGGTGTGCTTGTGTCGATAGCCCCAJTTCTGGATGTCCTTTTGATGACAGC 
418 SMI?REKSYRGK"DLQENYCR 437 

AAATCCTCGAGGGGAAGAAGGGGGACCCTGGTGTTTCACAAGCAATCCAGAGGTACGCTA 
TTTAGGAGCTCCCCTTCTTCCCCCTGGGACCACAAAGTG7TCGTTAGGTCTCCATGCGAT 

o 

j\ 438 NPRGEEGGPWCFTSKPEVRY 4 57 

i 

O CGAAGTCTGTGACATTCCTCAGTGTTCAGAAGTTGAATGCATGACCTGCAATGGGGAGAG 
O " GCTTCAGACACTGTAAGGAGTCACAAGTCTTCAACTTACGTACTGGACGTTACCCCTCTC 

* TTATCGAGGTCTCATGGATCATACAGA.ATCAGGCAAGATTTGTCAGCGCTGGGATCATCA 
O AATAGCTCCAGAGTACCTAGTATGTCTTAGTCCGTTCTAAACAGTCGCGACCCTAGTAGT 



458 EVCDI PQCSEVECMTCNGES 477 



47 8 YRGLMDKTESGKICQRWDHQ 4 97 



& 

m 

GACACCACACCGGCACAAATTCTTGCCTGAAAGATATCCCGACAAGGGCTTTGATGATAA _ ^ 
CI " CTGTGGTGTGGCCGTGTTTAAGAACGGACTTTCTATAGGGCTGTTCCCGAAACTACTATT 

498 TPKRHKFLPERYPDKGFDDN 517 

TTATTGCCGCAATCCCGATGGCCAGCCGAGGCCATGGTGCTATACTCTTGACCCTCACAC ^ 
AATAACGGCGTTAGGGCTACCGGTCGGCTCCGGTACCACGATATGAGAACTGGGAGTGTG 
518 YCRNPDGQPRPWCYTLDPKT 537 

CCGCTGGGAGTACTGTGCAATTAAAACATGCGCTGA.CAAAGCTGACGACGACGACAAACA _^ ^ ^ 
GGCGACCCTCATGACACGTTAATTTTGTACGCGACTGTTTCGACTGCTGCTGCTGTTTGT 
" 538 R W E Y C A I K T C A D K A D D D D K H 5 57 

CCACCA.CCACCACCACCACTAGGGTCGAC 

168 1 + 170 9 

GGTGGTGGTGGTGGTGGTGATCCCAGCTG 

5 58 K K K H H H * 5 53 
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FIG 3 a 
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1 



121 



TTTTATCTAGGTCGTGACTTCTATTTTTGGTTTTTTCACTTATGACGTCTGGTTACACGA 



AATAGATGTACTAGGAATAAAGGACTTCCATTCACTTGCAAGGCTTTTGTTTTTGATAAA 

24 1 - + -■ ~- ~ * 

TTATCTACATGATCCTTATTTCCTGAAGGTAAGTGAACGTTCCGAAAACAAAAACTAT.TT 



% 301 CGTTCTTTTGTTACGGAGACCAAGGGGAA.GTTATCGTACAGTTCACCTCACTTTTTTCTT 



361 



421 



TGGAGTTCCATGATACCACACGAACACAGCTATCGGGGTAAAGACCTACAGGAJ\AACTAC 

481 + ~Z 

ACCTCAAGGTACTATGGTGTGCTTGTGTCGATAGCCGCATTTCTGGATGTCCTTTTGATo 

152 W S SMI ? H £ K S Y3.GKDLQENY 



60 
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FIG 3b 

GGATCCGCCAGCCCGTCCAGCAGCACCATGTGGGrGACCAAACTCCTGCCAGCCCTGCTG 
CCTAGGCGGTCGGGCAGGTCGTCGTGGTACACCCACTGGTTTGAGGACGGTCGGGACGAC 
1 MWVTKLLPALLli 

CTGCAGCATGTCCTCCTGCATCTCCTCCTGCTCCCCATCGCCATCCCCTATGCAGAGGGA 

61 - -r + -r + 120 

GACGTCGTACAGGAGGACGTAGAGGAGGACGAGGGGTAGCGGTAGGGGATACGTCTCCCT 

12 L Q~ K V L L H L L • L ■ L P- I -A I P" Y A' "E G\ 31 



180 



CA^iGGAAJiAGAAGAAJlTACAATTCATGAATTCAAJ^AA-TCAGCAA-AGACTACCCTAATC 

j i- -j J -r 

GTTTCCTTTTCTTCTTTATGTTAAGTACTTAA-GTTTTTTAGTCGTTTCTGATGGGATTAG 
O 32QRKRRNTIHEFKKSAKTTLI 51 

yi! 

S! & sjiATAGATCCAGCACTGAAGATAAAAACCAAAAAAGTGAATACTGCAGACCAATGTGCT 

p 181 

o 

rfj 52KIDPALK1KTKKVNTADQC 



240 



/ 1 



300 



O 

ifi 72 N R C T R N K G L P F T C K A F V F D K 91 

111 GCAAGAAAACAATGCCTCTGGTTCCCCTTCAATAGCATGTCAAGTGGAGTGAAAAAAGAA 



360 

92ARKQCLWFP FNSMSSGVKKE 111 



420 



TTTGGCCATGAATTTGACCTCTATGAA-AACAAAGACTA.CATTAGAAACTGCATCATTGGT 

_1 a. + ~ + 

AAACCGGTACTTAAACTGGAGATACTTTTGTTTCTGATGTAATCTTTGACGTAGTAACCA 

112FGHEFDLYEN-KDYIRNCIIG 131 



480 



AAAGGACGCAGCTA.CAAGGGAACAGTATCTATCACTAAGAGTGGCATCAAATGTCAGCCC 
TTTCCTGCGTCGATGTTCCCTTGTCATAGATAGTGATTCTCACCGTAGTTTACAGTCGGG 
132 KGRSYKGTVSITKSGIKCQ? 15 



54! 



(continued) 
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TGTCGAVITCCTCGAGGGGA-GAA^GGGGACCCTGGTGTTTCACAAGCAATCCA-GAGGTA 

541 -f- + "f" + 600 

ACAGCTTTAGGAGCTCCCCTTCTTCCCCCTGGGACCACAAAGTGTTCGTTAGGTCTCCA? 

17 2CRNPRGEEGGPWCFTSNPEV 191 

CGCTACGAAGTCTGTGACATTCCTCAGTGTTCAGAAGTTGAATGCATGACCTGCAATGGG 
GCGATGCTTCAGACACTGTAAGGAGTCACAAGTCTTCAACTTACGTACTGGACGTTACCC ° 
192RYEVCDIPQCSEVECMTCNG 211 

t 

^ ^ gagagttatcgaggtctcatggatcatacagaatcagggaagatttgtcagcgctgggat 
ctctcaatagctccagagtacctagtatgtcttagtccgttctaaacagtcgcgacccta 
212 esyrglmdhtesgkicqrwd 231 

catcagacaccacaccggcacaaattcttgcctgaaagatatcccgacaagggctttgat 
i) gtagtctgtggtgtggccgtgtttaagaacggactttctatagggctgttcccgaaacta 
:;; 232 hqtphrfkflperyfdxgfd 251 

j] _ ^ gataattattgccggaatcccgatggccagccgaggccatggtggtatactcttgaccct 
0 ctattaataacggcgttagggctaccggtcggctccggtaccacgatatgagaactggga 
" 252 dnycrnpdgqprpwcytldp 271 

cacacccgctgggagtactgtgcaattaaaacatgcgctgacaaagcttcgggcggtggc 

;; ; 84 1 -h -r- ~ -r- 4- 900 

B GTGTGGGCGACCCTCATGACACGTTAATTTTGTACGCGACTGTTTCGAAGCCCGCCACGG 

'=J 272 HTRWEYCAIKTCADKASGGG 291 

% GGTTCTGGTGGGGGTGGCTCGGGCGGTGGCGGTTCTCTAGAGGGACAAAGGAAAAGAAGA 

==•■ SQ1 4- n -; -r + 960 

CGAAGAGGACCGCCACCGAGGCCGCCACCGCCAAGAGATCTCCGTGTTTCCT.TTTCTTC7 

292 GSGGGGSGGGGS LEGQRKRR 311 

AATACAATTCATGAATTCAAAAAATCAGCAAAGACTACCCTAATCAAAATAGATCCAGCA 

961 + + + -r + 1020 

TTATGTTAAGTACTTAAGTTTTTTAGTCGTTTCTGATGGGArTAGTTTTATCTAGGTCGT 

312 NTIHEFKKSAKTTLIKIDPA 331 

^ CTGAAGATAAAAACCAAAAAAGTGAATACTGCAGACCAATGTGCTAATAGATGTACTAGG ^ ^ ^ ^ 

GACTTGTATTTTTGGTTTTTTCACTTATGACGTCTGGTTACACGATTATCTACATGATCC 
332 L K I KTKKVNTADQC A M R C T ?, 351 

AATAA^GACT7CCATTCACT?GCA^GGCTTTTGTTTTTGATAAAGGA^GAAA\CAATGC 

106 1 11^0 

T T AT T T C C T G AAG G T AA.G T G AAC G T T C C G AAA-.C AAAAA C T AT TTCGTTCTTTTGTT AC G 

352 N KG L ? FT C K A F V F D X A R K Q C 371 

CTCTGGTTCCGCTTCAATAGCATGTCAAGTGGAGTGAAAAAAGAAT7TGGCCATGAATTT 

+ 4. + -r 1200 

GAGACCAAGGGGAAGTTATCGTACAGTTCACCTCACTTTTTTCTTAA=..CCGGTACTTAAA. 

37 2 L W F ? FN S MS S G V K K F F G K F F 3 91 
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(continued) 

GACCTCTATGAAAACAAAGACTACATTAGAAACTGCATCATTGGTAAAGGACGCAGCTAC 

CTGGAGATACTTTTGTTTCTGATGTAATCTTTGACGTAGTAACCATTTCCTGCGTCGA7G 1 2 6 ° 

DLYENKDYIRNCIIGKGRSY 411 

TTCCCTTGTCATAGATAGTGATTCTCACCGTAGm 1320 

KGTVSITKSGIKC; J2PWSSMI 431 

EE AC ^EE AACACAGCTATCGGGGTAA ^^ 

GGTGTGCTTGTGTCGATAGCCCCATTTCTGGATGTCCTTTTGATGACAGCTTTAGGAGCT ° 

PKEHSYRGKDLQENYCRNPR 4 51 

GGGG^AGAAGGGGGACCCTGGTGTTTCACAAGCAATCCA.GAGGTACGCTACGAAGTCTGT 

CCCCTTCTTCCCCCTGGGA.CCACAAAGTGTTCGTTAGGTCTCCATGCGATGCTTCAGACA 1 4 4 ° 

GEEGGPWCFTSNPEVRYEVC 471 

E A E A 2^EEZ CA EiEiEE AGAAGTTGA - ATGCATGACC 

CTGTAAGGAGTCACAAGTCTTCAACTTACGTACTGGACGTTACCCCTCTCAATAGCTcS 1 5 ° ° 

DIPQCSEVECMTCNGESYRG- 491 

GAGTACCTAGTA.TGTCTTAGTCCGTTCTAAACAGTCGCGACCCTAGTAGTCTGTGGTGTG 

LMDETESGKICQRWDHQTPH 511 

EEEE A E AAAT EE TTGCCTGAA_AGA ^^ 

GCCGTGTTTAAGAAXGGACTTTCTATAGGGCTGTTCCCGAAACTACTATTA^ATA^CGGCG 1 ^ ° 

RHKFLPERYPDKGFDDNYCR 531 

AATCCCGATGGCC^GCCGAGGCCA.TGGTGCTATACTCTTGACCCTCACACCCGCTGGGAG 

TTAGGGCTACCGGTCGGCTCCGGTACCACGATATGAGAACTGGGAGTGTGGGCGACCCTC 1 68 ° 

NPDGQPRPWCYTLDPETRWE 551 



1740 



TACTGTGCAATTAAAACATGCGCTGACA^^ 

ATGACACG7TAATTTTGTACGCGACTGTTTCGACTGCTGCTGCTGTTTGTGGTGGTGGTG 
YCAIKTCADKADDDDK H K H H 571 

CACCACCACTAGGGTCGAC 
— i75 q 

GTGGTGGTGATCGCAGCTG 

H H H * 57 4 
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